Probability Surveys 

Vol. 1 (2004) 72-172 

ISSN : 1549-5787 

DOI: 10.1214/154957804100000033 

Large deviations and stochastic calculus 
for large random matrices* 

Alice Guionnet 

UMPA, Ecole Normale Superieure de Lyon, 46, allee d'ltalie, 69364 Lyon Cedex 07, France 
e-mail: Alice . GUI0NNET0umpa.ens-lyon.fr 

Abstract: Large random matrices appear in different fields of mathematics 
and physics such as combinatorics, probability theory, statistics, operator 
theory, number theory, quantum field theory, string theory etc... In the 
last ten years, they attracted lots of interests, in particular due to a serie 
of mathematical breakthroughs allowing for instance a better understand- 
ing of local properties of their spectrum, answering universality questions, 
connecting these issues with growth processes etc. In this survey, we shall 
discuss the problem of the large deviations of the empirical measure of 
Gaussian random matrices, and more generally of the trace of words of 
independent Gaussian random matrices. We shall describe how such issues 
are motivated either in physics/combinatorics by the study of the so-called 
matrix models or in free probability by the definition of a non-commutative 
entropy. We shall show how classical large deviations techniques can be used 
in this context. 

These lecture notes are supposed to be accessible to non probabilists 
and non free-probabilists. 
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Chapter 1 

Introduction 



Large random matrices have been studied since the thirties when Wishart 132 
considered them to analyze some statistics problems. Since then, random ma- 
trices appeared in various fields of mathematics. Let us briefly summarize some 
of them and the mathematical questions they raised. 

1. Large random matrices and statistics : In 1928, Wishart considered 
matrices of the form Y NM = X N ' M (X N > M )* with an iVx M matrix X W ' M 
with random entries. Typically, the matrix "X. N,M is made of independent 
equidistributed vectors {X 1 ,- ■ ■ ,X N } in C M with covariance matrix S, 
= K[X}Xj] for 1 < i,j < M. Such random vectors naturally appear 
in multivariate analysis context where ~X. N ' M is a data matrix, the column 
vectors of which represent an observation of a vector in C M . In such a 
setup, one would like to find the effective dimension of the system, that 
is the smallest dimension with which one can encode all the variations of 
the data. Such a principal components analysis is based on the study of 
the eigenvalues and eigenvectors of the covariance matrix x Ar ' M (X Ar ' M )*. 
When one assumes that the column vectors have i.i.d Gaussian entries, 
Y^,m - 1S ca n e( j a standard Gaussian Wishart matrix. In statistics, it used 
to be reasonable to assume that N/M was large. However, the case where 
N/M is of order one is nowadays commonly considered, which corresponds 
to the cases where either the number of observations is rather small or 
when the dimension of the observation is very large. Such cases appear 
for instance in problems related with telecommunications and more pre- 
cisely the analysis of cellular phones data, where a very large number of 
customers have to be treated simultaneously (see [23 11161 I12()| and refer- 
ences therein). Other examples are provided in |8Uj . 

In this setting, the main questions concern local properties of the spectrum 
(such as the study of the large N, M behavior of the spectral radius of 
Y N ' M , see jSU], the asymptotic behaviour of the k largest eigenvalues 
etc.), or the form of the eigenvectors of Y N ' M (see |12(J| and references 
therein) . 
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2. Large random matrices and quantum mechanics : Wigner, in 1951 
131 , suggested to approximate the Hamiltonians of highly excited nuclei 
by large random matrices. The basic idea is that there are so many phe- 
nomena going on in such systems that they can not be analyzed exactly 
and only a statistical approach becomes reasonable. The random matrices 
should be chosen as randomly as possible within the known physical re- 
strictions of the model. For instance, he considered what we shall later on 
call Wigner's matrices, that is Hermitian (since the Hamiltonian has to be 
Hermitian) matrices with i.i.d entries (modulo the symmetry constraint). 
In the case where the system is invariant by time inversion, one can con- 
sider real symmetric matrices etc... As Dyson pointed out, the general idea 
is to chose the most random model within the imposed symmetries and 
to check if the theoretical predictions agree with the experiment, a dis- 
agreement pointing out that an important symmetry of the problem has 
been neglected. It turned out that experiments agreed exceptionally well 
with these models; for instance, it was shown that the energy states of the 
atom of hydrogen submitted to a strong magnetic field can be compared 
with the eigenvalues of an Hermitian matrix with i.i.d Gaussian entries. 
The book j59j summarizes a few similar experiments as well as the history 
of random matrices in quantum mechanics. 

In quantum mechanics, the eigenvalues of the Hamiltonian represent the 
energy states of the system. It is therefore important to study, following 
Wigner, the spectral distribution of the random matrix under study, but 
even more important, is its spacing distribution which represents the en- 
ergy gaps and its extremal eigenvalues which are related with the ground 
states. Such questions were addressed in the reference book of M.L. Mehta 
|98| . but got even more popular in mathematics since the work of C. Tracy 
ct H. Widom |117j . It is also important to make sure that the results ob- 
tained do not depend on the details of the large random matrix models 
such as the law of the entries; this important field of investigation is often 
referred to as universality. An important effort of investigation was made 
in the last ten years in this direction for instance in j2H], |52|, |7fi | .|89 | . 

una, ins, msi, una ... 

3. Large random matrices and Riemann Zeta function : The Riemann 
Zeta function is given by 

oo 

C( S ) = 5>- S 

71=1 

with Re(s) > 1 and can be analytically continued to the complex plane. 
The study of the zeroes of this function in the strip < Re(s) < 1 furnishes 
one of the most famous open problems. It is well known that C has trivial 
zeroes at —2,-4,-6.... and that its zeroes are distributed symmetrically 
with respect to the line Re(s) = 2 _1 . The Riemann conjecture is that 
all the non trivial zeroes are located on this line. It was suggested by 
Hilbert and Polya that these zeroes might be related to the eigenvalues 
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of a Hermitian operator, which would immediately imply that they are 
aligned. To investigate this idea, H. Montgomery (1972), assuming the 
Riemann conjecture, studied the number of zeroes of the zeta function 
in Re(s) = 2 -1 up to a distance T of the real axis. His result suggests 
a striking similarity with corresponding statistics of the distribution of 
the eigenvalues of random Hermitian or unitary matrices when T is large. 
Since then, an extensive literature was devoted to understand this relation. 
Let us only point out that the statistical evidence of this link can only 
be tested thanks to enormous numerical work, in particular due to A. 
Odlyzko |99lll()()| who could determine a few hundred of millions of zeroes 
of Riemann zeta function around the 10 20 -th zeroes on the line Re(s) = 
2-\ 

In somewhat the same direction, there is numerical evidence that the 
eigenvalues distribution of large Wigner matrices also describes the large 
eigenvalues of the Laplacian in some bounded domain such as the cardioid. 
This is related to quantum chaos since these eigenvalues describe the long 
time behavior of the classical ray dynamics in this domain (i.e. the billiard 
dynamics). 

4. Large random matrices and free probability Free probability is a 
probability theory in a non-commutative framework. Probability measures 
are replaced by tracial states on von Neumann algebras. Free probability 
also contains the central notion of freeness which can be seen as a non- 
commutative analogue of the notion of independence. At the algebraic 
level, it can be related with the usual notion of freeness. This is why free 
probability could be well suited to solve important questions in von Neu- 
mann algebras, such as the question of isomorphism between free group 
factors. Eventhough this goal is not yet achieved, let us quote a few results 
on von Neumann algebras which were proved thanks to free probability 
machinery [Ell- 
in the 1990's, Voiculescu |121j proved that large random matrices are 
asymptotically free as their size go to infinity. Hence, large random matri- 
ces became a source for constructing many non-commutative laws, with 
nice properties with respect to freeness. Thus, free probability can be 
considered as the natural asymptotic large random matrices framework. 
Conversely, if one believes that any tracial state could be approximated 
by the empirical distribution of large matrices (which we shall define more 
precisely later) , which would answer in the affirmative a well known ques- 
tion of A. Connes, then any tracial state could be obtained as such a 
limit. 

In this context, one often studies the asymptotic behavior of traces of 
polynomial functions of several random matrices with size going to infin- 
ity, trying to deduce from this limit either intuition or results concerning 
tracial states. For instance, free probability and large random matrices can 
be used to construct counter examples to some operator algebra questions. 

5. Combinatorics, enumeration of maps and matrix models 

It is well known that the evaluation of the expectation of traces of random 
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matrices possesses a combinatorial nature. For instance, if one considers a 
N x N symmetric or Hermitian matrix Xyv with i.i.d centered entries with 
covariance TV -1 , it is well known that E\N~ lr YY(~X? N )\ converges toward 
if p is odd and toward the Catalan number Cs. if p is even. C p is the 
number of non crossing partitions of {1, ■ • • , 2p} and arises very often in 
combinatorics. This idea was pushed forward by J. Harer and D. Zagier 
|68j who computed exactly moments of the trace of to enumerate 
maps with given number of vertices and genus. This combinatorial aspect 
of large random matrices was developed in the free probability context by 



This strategy was considerably generalized by 't Hooft who saw that ma- 
trix integrals such as 



with a polynomial function P and independent copies XL of Xjv, can 
be seen as generating functions for the enumeration of maps of various 
types. The formal proof follows from Feynman diagrams expansion. This 
relation is nicely summarized in an article by A. Zvonkin and we shall 
describe it more precisely in Chapter [S] One-matrix integrals can be used 
to enumerate various maps of arbitrary genus (maps with a given genus g 
appearing as the N~~ 2g correction terms in the expansion of Zjy(P)), and 
several matrix integrals can serve to consider the case where the vertices 
of these maps are colored, i.e. can take different states. For example, two- 
matrix integrals can therefore serve to define an Ising model on random 
graphs. 

Matrix models were also used in physics to construct string theory models. 
Since string theory concerns maps with arbitrary genus, matrix models 
have to be considered at criticality and with temperature parameters well 
tuned with the dimension in order to have any relevance in this domain. 
It seems that this subject had a great revival in the last few years, but it 
seems still far from mathematical (or at least my) understanding. 
Haar distributed Unitary matrices also can be used to enumerate combina- 
torial objects due to their relation with representations of the symmetric 
group (cf . jri4| for instance) . Nice applications to the enumeration of magic 
squares can be found in |38| . 

In this domain, one tries to estimate integrals such as Zn(P), and in 
particular tries to obtain the full expansion of log Zjq (P) in terms of the 
dimension N. This could be done rigorously so far only for one matrix 
models by use of Riemann-Hilbert problem techniques by J. Mc Laughlin 
et N. Ercolani |4b|. First order asymptotics for a few several-matrix models 
could be obtained by orthogonal polynomial methods by M. L. Mehta |931 
1901 and by large deviations techniques in [HI] . The physics literature 
on the subject is much more consistent as can be seen on the arxiv (see 
work by V. Kazakov, I. Kostov, M. Staudachcr, B. Eynard, P. Zinn Justin 
etc.). 



R. Speicher JUSJ 



Z N (P) = E[e NTr ( p &»'-<*» 
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6. Large random matrices, random partitions and determinantal 
laws 

It is well know |93| that Gaussian matrices have a determinantal form, i.e. 
the law of the eigenvalues (Ai, • • • , \n) of a Wigner matrix with complex 
Gaussian entries (also called the GUE) is given by 

dP(Ai, ■■■ ,\ N ) = Z^AiXfe-- A * J] d\i 
with Zjv the normalizing constant and 
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Because A is a determinant, specific techniques can be used to study for 
instance the law of the top eigenvalue or the spacing distribution in the 
bulk or next to the top (cf. |117) 1. Such laws appear actually in differ- 
ent contexts such as random partitions as illustrated in the work of K. 
Johansson 77 or tilling problems [75]. For more general remarks on the 
relation between random matrices and random partitions, see jlOlj - 
In fact, determinantal laws appear naturally when non-intersecting paths 
are involved. Indeed, following [HSj, if kr is the transition probability of a 
homogeneous continuous Markov process, and P^f the distribution of N 
independent copies X^ = (xi(t), ■ ■ ■ ,XN(t)) of this process, then for any 
X = (xi, ■ ■ ■ ,x N ),Xi < x 2 < ■■ ■ < x N , Y = (yi, ■ ■ ■ ,y N ),yi < 2/2 < • • • < 
yjq, the reflection principle shows that 

P(X N (0) = X,X N (T) = Y\Vt > 0, Xl (t) < x 2 (t) < ■ >-x N {t)) 

= C(x)det ((k T (x t , yj )) i<,<„ ) (1.0.1) 

\ t<j<N / 

with 

C{x)~ l = I det ((k T (xi : yj))i<i<N ) dy. 

J \ l<j<N / 

This might provide an additional motivation to study determinantal laws. 
Even more striking is the occurrence of large Gaussian matrices laws for 
the problem of the longest increasing subsequence [Hj, directed polymers 
and the totally asymmetric simple exclusion process [JH]. These relations 
are based on bijections with pairs of Young tableaux. 
In fact, the law of the hitting time of the totally asymmetric simple ex- 
clusion process (TASEP) starting from Heaviside initial condition can be 
related with the law of the largest eigenvalue of a Wishart matrix. Let us 
remind the reader that the (TASEP) is a process with values in {0, 1} Z , 
representing the fact that the site is empty and 1 that it is occupied, 
the dynamics of which are described as follows. Each site of Z is equipped 
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with a clock which rings at times with exponential law. When the clock 
rings at site i, nothing happens if there is no particle at i or if there is 
one at i+ 1. Otherwise, the particle jumps from i to i + 1. Once this clock 
rang, it is replaced by a brand new independent clock. K. Johansson [75] 
considered these dynamics starting from the initial condition where there 
is no particles on Z + but one particle on each site of Z _ . The paths of 
the particles do not intersect by construction and therefore one can expect 
the law of the configurations to be determinantal. The main question to 
understand is to know where the particle which was at site — N, N E N, 
at time zero will be at time T. In other words, one wants to study the 
time H(N, M) that the particle which was initially at — N needs to get 
to M — N . K. Johansson [JSJ has shown that H(M, N) has the same 
law as of the largest eigenvalue of a Gaussian complex Wishart matrix 
X N+i,M( X N+i,My w here X N+1 > M is a (N + 1) x M matrix with i.i.d 
complex Gaussian entries with covarian.ce 2 _1 . This remark allowed him 
to complete the law of large numbers result of Rost by the study of 
the fluctuations of order N$. 

This paper opens the field of investigation of diverse growth processes 
(cf. Forrester [331), to the problem of generalizing this result to different 
initial conditions or to other problems such as tilling models [78| . In this 
last context, one of the main results is the description of the fluctuation of 
the boundary of the tilling in terms of the Airy process (cf. M. Prahofcr 
and H. Spohn |114j and K. Johansson |79|1. 

In this set of problems, one usually meets the problem of analyzing the 
largest eigenvalue of a large matrix, which is a highly non trivial analysis 
since the eigenvalues interact by a Coulomb gas potential. 

In short, large random matrices became extremely fashionable during the 
last ten years. It is somewhat a pity that there is no good introductory book to 
the field. Having seen the six aspects of the topic I tried to describe above and 
imagining all those I forgot, the task looks like a challenge. 

These notes are devoted to a very particular aspect of the study of large 
random matrices, namely, the study of the deviations of the law of large ran- 
dom matrices macroscopic quantities such as their spectral measures. It is only 
connected to points 4 and 5 listed above. Since large deviations results are re- 
finements of law of large numbers theorems, let us briefly summarize these last 
results here. 

It has been known since Wigner that the spectral measure of Wigner matrices 
converges toward the semicircle law almost surely. More precisely, let us con- 
sider a Wigner matrix, that is a N x N selfadjoint matrix X. N with independent 
(modulo the symmetry constraint) equidistributed centered entries with covari- 
ance AT -1 . Let (Ai, • • • , Xn) be the eigenvalues of X w . Then, it was shown by 
Wigner (lrilj . under appropriate assumptions on the moments of the entries, 
that the spectral measure ft = N^ 1 $\i converges almost surely toward the 
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semi-circle distribution 

a(dx) ~ C\/ 4 — x 2 l\ x \< 2 dx. 

This result was originally proved by estimating the moments {./V~ 1 Tr((X Ar ) p ),p £ 
N}, which is a common strategy to study the spectral measure of self-adjoint 
random matrices. 

This convergence can also be proved by considering the Stieljes transform 
of the spectral measure following Z. Bai 0], which demands less hypothesis 
on the moments of the entries of X w . In the case of Gaussian entries, this 
result can be easily deduced from the large deviation principle of Section [3J 
The convergence of the spectral measure was generalized to Wishart matrices 
(matrices of the form X JV R N (K N )* with a matrix ~K N with independent entries 
and a diagonal matrix R ) by Pastur and Marchenko Another interesting 

question is to wonder, if you are given two arbitrary large matrices {A, B) with 
given spectrum, how the spectrum of the sum of these two matrices behave. Of 
course, this depends a lot on their eigenvectors. If one assumes that A and B 
have the same eigenvectors and i.i.d eigenvalues with law /i and v respectively, 
the law of the eigenvalues of A + B is the standard convolution /i * v. On the 
contrary, if the eigenvectors of A and B are a priori not related, it is natural to 
consider A + UBU* with U following the Haar measure on the unitary group. 
It was proved by D. Voiculescu |122| that the spectral measure of this sum 
converges toward the free convolution \ia EH fJ>B if the spectral measure of A 
(resp. B) converges toward \xa (resp . fis) as the size of the matrices goes to 
infinity. More generally, if one considers the normalized trace of a word in two 
independent Wigner matrices then Voiculescu |122| proved that it converges in 
expectation (but actually also almost surely) toward a limit which is described 
by the trace of this word taken at two free semi-circular variables. We shall 
describe the notion of freeness in Chapter 

The question of the fluctuations of the spectral measure of random matri- 
ces was initiated in 1982 by D. Jonsson for Wishart matrices by using 
moments method. This approach was applied and improved by A. Soshnikov 
an Y. Sinai |110| who considered Wigner matrices with non Gaussian entries 
but sufficient bounds on their moments and who obtained precise estimates 
on the moments {N~ 1 Tr({X. N ) p ),p £ N}. Such results were generalized to the 
non-commutative setting where one considers polynomial functions of several 
independent random matrices by T. Cabanal Duvillard [2B| and myself [SOI- Re- 
cently, J. Mingo and R. Speicher |96) gave a combinatorial interpretation of the 
limiting covariance via a notion of second order freeness which places the prob- 
lem of fluctuations to its natural non-commutative framework. They applied 
it with P. Sniady |97| to unitary matrices, generalizing to a non-commutative 
framework the results of P. Diaconis and M. Shahshahani [37j showing that 
traces of moments of unitary matrices converge towards Gaussian variables. In 
jfi()| . I used the non-commutative framework to study fluctuations of the spec- 
tral measure of Gaussian band matrices, following an idea of D. Shlyakhtcnko 
109 ; . On the other hand, A. Khorunzhy, B. Khoruzhenko and L. Pastur jHH] and 
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more recently Z. Bai and J.F Yao 6 developed Stieljes transforms technology 
to study the central limit theorems for entries with eventually only the four first 
moments bounded. Such techniques apply at best to prove central limit theo- 
rem for nice analytic functions of the matrix under study. K. Johansson |73| 
considered Gaussian entries in order to take advantage that in this case, the 
eigenvalues have a simple joint law, given by a Coulomb gas type interaction. In 
this case, he could describe the optimal set of functions for which a central limit 
theorem can hold. Note here that in |HU], the covariance is described in terms 
of a positive symmetric operator and therefore such an optimal set should be 
described as the domain of this operator. However, because this operator acts 
on non-commutative functions, its domain remains rather mysterious. A gen- 
eral combinatorial approach for understanding the fluctuations of band matrices 
with entries satisfying for instance Poincare inequalities and rather general test 
functions has recently been undertaken by G. Anderson and O. Zeitouni 

In these notes, we shall study the error to the typical behavior in terms of large 
deviations in the cases listed above, with the restriction to Gaussian entries. 
They rely on a series of papers I have written on this subject with different 
coauthors [TU [1312313011111101112111123 and try to give a complete accessible 
overview of this work to uninitiated readers. Some statements are improved or 
corrected and global introductions to free probability and hydrodynamics/large 
deviations techniques are given. While full proofs are given in Chapter 3 and 
rather detailed in Chapter 4, Chapter 7 only outlines how to adapt the ideas 
of Chapter 4 to the non-commutative setting. Chapter 5 uses the results of 
Chapter 1 and Chapter 4 to study matrix models. These notes are supposed 
to be accessible to non probabilists, if they assume some facts concerning Ito's 
calculus. 

First, we shall consider the case of Wigner Gaussian matrices (see Chapter[3J). 
The case of non Gaussian entries is still an open problem. We generalize our 
approach to non centered Gaussian entries in Chapter 01 which corresponds to 
the deviations of the law of the spectral measure of A + X with a deterministic 
diagonal matrix A and a Wigner matrix X. This result in turn gives the first 
order asymptotics of spherical integrals. The asymptotics of spherical integrals 
allows us to estimate matrix integrals in the case of quadratic (also called AB) 
interaction. Such a study puts on a firm ground some physics papers of Matytsin 
for instance. It is related with the enumeration of colored planar maps. We 
finally present the natural generalization of these results to several matrices, 
which deals with the so-called free micro-states entropy. 
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For N e N, Mn will denote the set of TV x N matrices with complex entries, 
H.N (resp. Sn) will denote the set of N x N Hermitian (resp. symmetric) ma- 
trices. U(N) (resp. O(N), resp S(N)) will denote the unitary (resp. orthogonal, 
resp. symplectic) group. We denote Tr the trace on Mn, Tr(A) = J2iLi An and 
tr the normalized trace tr(^4) = N~ lr Tr(A). 

To denote an ordered product of non-commutative variables X\ , • • • X n (such 
as matrices), we write in short 



C[Xi, • • • ,X n ] (resp. C(X\, ■ ■ ■ ,X n )) denotes the space of commutative (resp. 
non-commutative) polynomials in n variables for which 



IIi<i< n X 0i = UT<i<n X *Ui) ( reS P- IIl<i<n X 3i + V^< % <n X *{n)) for a11 choices 



of indices 1 < i < n, n e N} (resp. eventually for a choice of {jj, 1 < i < n, 
n e N}) and for all permutation a (resp. eventually for some permutation a). 

For a Polish space X,V(X) shall denote the set of probability measures on X. 
V(X) will be equipped with the usual weak topology, ie a sequence [i n e V(X) 
converges toward fj, iff for any bounded continuous function / on X, yU„(/) 
converges toward //(/). Here, we denote in short 



For two Polish spaces X, Y and a measurable function 4> '■ X — > Y, for any 
/U G 'P(X) we denote <fi#/j, G V(Y) the push forward of ^ by <f>, that is the 
probability measure on Y such that for any bounded continuous / : Y — > R, 



For a given selfadjoint N x N matrix A, we denote (Ai(A), • • • , Ajv(A)) its 
N (real) eigenvalues and by its spectral measure 



X\X 2 ■ ■ ■ X n — Y\ X i- 



Ki<n 






For two Polish spaces X, Y we denote by C®(X,Y) (or C(X,Y) when no 
ambiguity is possible) the space of bounded continuous functions from X to Y. 
For instance, we shall denote C([0, 1],V(M.)) the set of continuous processes on 
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[0, 1] with values in the set V{R) of probability measures on K, endowed with its 
usual weak topology. For a measurable set of R x [0, 1], C^' q (£l) denotes the 
set of real- valued functions on £1 which arc p times continuously diffcrcntiable 
with respect to the (first) space variable and q times continuously differentiable 
with respect to the (second) time variable with bounded derivatives. C^' q (fl) 
will denote the functions of C^' q (Q) with compact support in the interior of the 
measurable set f2. For a probability measure /i on a Polish space X, L p (dfi) 
denotes the space of measurable functions with finite p th moment under /z. We 
shall say that an equality holds in the sense of distribution on a measurable set 
Q if it holds, once integrated with respect to any C£°'°°(Q) functions. 



Chapter 2 

Basic notions of large 
deviations 

Since these notes are devoted to the proof of large deviations principles, let us 
remind the reader what is a large deviation principle and the few main ideas 
which are commonly used to prove it. We refer the reader to and 03| 
for further developments. In what follows, X will be a Polish space (that is a 
complete separable metric space). We then have 

Definition 2.1. • / : X — > K + U {+00} is a rate function, iff it is lower 
semi-continuous, i.e. its level sets {x £ X : I(x) < M} are closed for any 
M > 0. It is a good rate function if its level sets {x G X : I(x) < M} are 
compact for any M > 0. 
• A sequence (/xa^tvgn of probability measures on X satisfies a large devi- 
ation principle with speed (or in the scale) ajv (going to infinity with N) 
and rate function I iff 

a) For any closed subset F of X, 

lira sup — log /1 n {F) < — inf /. 

b) For any open subset O of X , 

liminf — log /ijy(O) > — inf/. 

n-^oo on o 

The proof of a large deviation principle often proceeds first by the proof of a 
weak large deviation principle (which is defined as in definition 1)2.1(1 except that 
the upper bound is only required to hold for compact sets) and the so-called 
exponential tightness property 

Definition 2.2. A sequence (/UAr)jveN of probability measures on X is exponen- 
tially tight iff there exists a sequence (Kl)l<en of compact sets such that 

lim sup lim sup — log fijy(KI) — — 00. 
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A weak large deviation principle is itself equivalent to the estimation of the 
probability of deviations towards small balls 

Theorem 2.3 ([41 , Theorem 4.1.11). Let A be a base of the topology of X . 
For every A £ A, define 

La = — liminf — log /in (A) 

N—>oo ON 

and 

I(x) = sup La- 



Suppose that for all x € X , 



I(x) — sup 

AeA-.xGA 



lim sup — log fi at (A) 

AT-KX) 0-N 



Then, /ijv satisfies a weak large deviation principle with rate function I . 

As an immediate corollary, we find that if d is a distance on X compatible 
with the weak topology and B(x, S) = {y G X : d(y, x) < 5}, 

Corollary 2.4. Assume that for all x € X 

—I(x) := lim sup lim sup — log /in(B(x, 8)) = liminf liminf — log /in(B(x, 5)). 

Then, /ijy satisfies a weak large deviation principle with rate function I. 

From a given large deviation principle one can deduce a large deviation prin- 
ciple for other sequences of probability measures by using either the so-called 
contraction principle or Laplace's method. Namely, let us recall the contraction 
principle (cf. Theorem 4.2.1 in 01]) : 

Theorem 2.5. Assume that (/in)n£N satisfies a large deviation principle with 
good rate function I with speed oat . Then for any function F : X —> Y with val- 
ues in a Polish space Y which is continuous, the image (-F#^iv)jVeN € V(Y) N 
also satisfies a large deviation principle with the same speed and good rate func- 
tion given for any y € Y by 

J(y) = inf{J(z) : F{x) = y}. 

Laplace's method (or Varadhan's Lemma) says the following (cf. Theorem 
4.3.1 |H]): 

Theorem 2.6. Assume that (hn)n£N satisfies a large deviation principle with 
good rate function I . Let — ► K be a bounded continuous function. Then, 

li m J_log / e aNF ^d^ N (x) = sup{F(x) - I(x)}. 

N^ca ON J x£X 
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Moreover, the sequence 



1 



F ^d m {x) e V(X) 



J e aNF (y1d^ N {y) 



satisfies a large deviation principle with good rate function 



J(x) = I(x) - F(x) - sup{F(y) - I(y)}. 



vex 



Bryc's theorem (@J, Section 4.4) gives an inverse statement to Laplace the- 
orem. Namely, assume that we know that for any bounded continuous function 
F : X — > R, there exists 



Then, Bryc's theorem says that /ijv satisfies a weak large deviation principle 
with rate function 



This actually provides another approach to proving large deviation principles : 
We see that we need to compute the asymptotics l|2.0.1|) for as many bounded 
continuous functions as possible. This in general can easily be done only for some 
family of functions (for instance, if /xjv is the law of TV -1 XL =1 Xi for independent 
equidistributed bounded random variable x^s, on = N, such quantities are easy 
to compute for linear functions F) . This will always give a weak large deviation 
upper bound with rate function given as in (|2.U.2f) but where the supremum is 
only taken on this family of functions. The point is then to show that in fact 
this family is sufficient, in particular this restricted supremum is equal to the 
supremum over all bounded continuous functions. 




(2.0.1) 



I(x) 



sup {F(x) - A(F)}. 

FeC°(X,R) 



(2.0.2) 



Chapter 3 

Large deviations for the 
spectral measure of large 
random matrices 



3.1. Large deviations for the spectral measure of Wigner Gaussian 
matrices 

Let ~X. N 'i 3 = ^x/J'' 3 ^ be N x N real (resp. complex) Gaussian Wigner matrices 

when (3=1 (resp. j3 = 2, resp. ,9 = 4) denned as follows. They are N x N 
self-adjoint random matrices with entries 



al. = , 1 < k < I < N, A u ' = 4 Qkkea, l<k<N 

■fBN ~~ V /3N " 



where (e^)i<i<^ is a basis of R' 3 , that is e} = 1, e| = 1, e| = i. This defini- 
tion can be extended to the case j3 = 4 when AT is even by choosing X^'' 5 = 
(X^'P J n with X^'' 3 a 2 x 2 matrix defined as above but with (e^)i<fe<4 



i<*,i<# 
the Pauli matrices 

i o \ 2 _/o -n 3 _ / o -< A 4 _ / < o 
o i J ' 64 - ^ i o ) ' 64 - ^ -i o ) ' 64 - ^ -t 

(g k i,k < 1,1 < i < (3) are independent equidistributed centered Gaussian vari- 
ables with variance 1. (X ' 2 ,N <G N) is commonly referred to as the Gaussian 
Unitary Ensemble (GUE), (X^'jJV G N) as the Gaussian Orthogonal Ensem- 
ble (GOE) and (X*' 4 ^ E N) as the Gaussian Symplectic Ensemble (GSE) 
since they can be characterized by the fact that their laws are invariant un- 
der the action of the unitary orthogonal and symplectic group respectively (see 

USD- 
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X N >P has N real eigenvalues (Ai, A2, • • ■ , Ajv)- Moreover, by invariance of the 
distribution of X*' 1 (resp. X W ' 2 , resp. X^' 4 ) under the action of the orthogonal 
group O(N) (resp. the unitary group U(N), resp. the symplectic group S(N)), 
it is not hard to check that its eigenvectors will follow the Haar measure m N on 
O(N) (resp. U(N), resp. S(N)) in the case (3=1 (resp. 13 = 2, resp. (3 = 4). More 
precisely, a change of variable shows that for any Borel subset A C A4nxn(^-) 
(resp. MnxnCC)), 

P (X^ e A) = J lu D {x)u*eAdm% (U)dQ% (A) (3.1.1) 

with D(X) = diag(Ai, A2, • • • , Xn) the diagonal matrix with entries (Ai < A2 < 
■ • • < An) and Q 1 ^ the joint law of the eigenvalues given by 

N N 

Q N (dX u d\ N ) = —^■A(X) 13 eM-jNY, ^} II rfA - 



N 



where A is the Vandermonde determinant A(A) = rii<i<j<iv — •M an d Z (3 
is the normalizing constant 

N N 

l<i<j<N i=l i=l 

Such changes of variables are explained in details in the book in preparation of 
P. Forrester |53| . 

Using this representation, it was proved in US| that the law of the spectral 
measure fi N = -i? Yli=i > as a probability measure on M, satisfies a large 
deviation principle. 

In the following, we will denote V{M) the space of probability measure on M, 
and will endow V(1R) with its usual weak topology. We now can state the main 
result of US]. 

Theorem 3.1. 

l)Let Ip(n) = j J x 2 d/i(x) — ^E(/i) — |/3 with £ the non- commutative entropy 

= / flog\x- y\d(i(x)dn(y). 

Then : 

a. 1/3 is well defined on V(1R) and takes its values in [0, +00]. 

b. IpifJ-) is infinite as soon as /1 satisfies one of the following conditions : 
b.l : f x 2 d[i(x) = +00. 

b.2 : There exists a subset A of IR of positive /i mass but null logarithmic 
capacity, i.e. a set A such that : 

fxiA) > 7(^4):=exp<- inf [flog- — - — i dy(x)du(y) I = 0. 

ueM+{A)J J \x-y\ I 
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c. Ip is a good rate function, i.e. {Ip < M} is a compact subset ofV(lR) for 
M > 0. 

d. Ip is a convex function onV(M). 

e. Ip achieves its minimum value at a unique probability measure on IR 
which is described as the Wigner's semicircular law up = (27r) _1 \/4 — x 2 dx. 

2) The law of the spectral measure fi N = -h 53j=i on ^(K) satisfies a full 
large deviation principle with good rate function Ip in the scale N 2 . 

Proof : We here skip the proof of l.b.2. and l.d and refer to ^U] for these 
two points. The proof of the large deviation principle is rather clear; one writes 
the density Qp of the eigenvalues as 

1 8 f N 

g^(dA 1 ,---,dA A r) = -^exp{-^7V 2 / f(x,y)dfi N (x)dfi N (y)}r[e-^dX i , 



wil 1 i 



f{x 1 y)=log\x-y\- 1 + ^{x 2 +y 2 ) 



If the function x, y — > l x ^ y f(x, y) were bounded continuous, the large deviation 
principle would result directly from a standard Laplace method (cf. Theorem 
12. 6[) . where the entropic term coming from the underlying Lebesgue measure 
could be neglected since the scale of the large deviation principle we are con- 
sidering is N 2 ^> N. In fact, the main point is to deal with the fact that the 
logarithmic function blows up at the origin and to control what happens at the 
diagonal A :— {x — y}. In the sequel, we let Q^ be the non-normalized positive 
measure Qp = Z^j Qp and prove upper and lower large deviation estimates 
with rate function 

■Mm) = "2/ ( f{ x ,y)dli{x)dn(y). 

This is of course enough to prove the second point of the theorem by taking 
F = O = V(1R) to obtain 

n^N 2 s " 2«=v(Jft) 



J im TH lo S z p = ~ o inl m / / f{x,y)dv(x)dv(y). 



To obtain that this limit is equal to one needs to show that the infimum is 
taken at the semi-circle law and then compute its value. Alternatively, Selberg's 
formula (see allows to compute Zp explicitly from which its asymptotics 
are easy to get. We refer to JUj for this point. The upper bound is obtained 
as follows. Noticing that fi N (8 fi N (A) = N^ 1 Q^-almost surely (since the 
eigenvalues are almost surely distinct), we see that for any M 6 M + , Qp a.s., 

l x ^yf(x,y)dft N (x)dfi N (x) > J J l x ^ y f(x,y) A Mdfi N (x)dfi N (x) 

f(x,y)AMdf, N (xW N (x)-^ 
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i^-inf^Al f f(x,y)r\Mdu(x)dn(y)}+NM 



(3.1.2) 

resulting with 

limsup ^_ log (q% (1 £ <5 A , e A)^ < -| mf { | /(a,, y) A Md/*(a)d/i(y)}. 

We now show that if A is closed, M can be taken equal to infinity in the above 
right hand side. 

We first observe that Ip(p) = § / f(x,y)dfi(x)dfi(y) — |/3 is a good rate 
function. Indeed, since / is the supremum of bounded continuous functions, Ip 
is lower semi-continuous, i.e. its level sets are closed. Moreover, because / blows 
up when x or y go to infinity its level sets are compact. Indeed, if m = — inf /, 

[ inf ' (f + m)(x,y)H[-A,A} c f < [ ' (f + m)(x,y)d»(x)d»(y) 

x,ye[-A,AY J 

2 3 

= p I l 3 ^ + 8 +m 



{Ip < M} C K 



resulting with 
where Km is the set 

K M = n j4 >o \ it e V(M);n([-A, A] c ) < 



2M 



I 2/3" 1 M + m+§ 
ia£ Xiye [- AiA ]c(f + m)(x,y) 



Km is compact since inf^, . v ^[-a + rn)(x, y) goes to infinity with A. Hence, 
{1/3 < M} is compact, i.e. Ip is a good rate function. 

Moreover, (|3.1.2|l and the above observations show also that 

1 N 

lim sup lim sup Qp ( — \^ Sx t € K M ) = — oo 

insuring, with the uniform boundedness of N~ 2 logZff, the exponential tight- 
ness of Qg . Hence, me may assume that A is compact, and actually a ball 
surrounding any given probability measure with arbitrarily small radius (see 
Chapter |2J). Let B(/j,,5) be a ball centered at fi £ V(M) with radius 6 for a 
distance compatible with the weak topology, 
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Since fx — > J J f(x, y) A Mdfi(x)dfi(y) is continuous for any \i G V(M), (|3.1.2[) 
shows that for any probability measure \i G V(1R) 

1 1 N 

lim sup lim sup — log Q% hjY)^ e B(/j,,5)) 

<- [ [ f(x,y)AMd f i(x)d f i(y) (3.1.3) 



We can finally let A/ going to infinity and use the monotone convergence theorem 
which asserts that 

j\l^oo / ^ A Md v( x ) d n(y) = J f( x > y)d^{x)dfi(y) 
to conclude that 

1 - 1 N f f 

lim sup lim sup— -log<2^( — V^a, G B(ji,8)) < - / / f(x,y)dfj,(x)dfj,(y) 

8^0 N^oo N N ~{ J J 

finishing the proof of the upper bound. 

To prove the lower bound, we can also proceed quite roughly by constraining 
the eigenvalues to belong to very small sets. This will not affect the lower bound 
again because of the fast speed TV 2 ^> N log N of the large deviation principle 
we are proving. Again, the difficulty lies in the singularity of the logarithm. The 
proof goes as follows; let v G V(M). Since Ip[v) = +oo if v has an atom, we 
can assume without loss of generality that it does not when proving the lower 
bound. We construct a discrete approximation to v by setting 



x 



inf < x\ v (] — oo, x]) > 



1 



N + 1 



' ! A = inf \ x > x l ' N \ z/Qa^.a;]) > j^-j \ 1 < i < N - 1. 



and 2/^ = X)i=i *r iJV (note here that the choice of the length (N +1) 1 of 
the intervals rather than TV -1 is only done to insure that x N ' N is finite). Then, 
v N converges toward v as N goes to infinity. Thus, for any 5 > 0, for N large 
enough, if we set Ajv := {Ai < A 2 ■ • • < Ajv} 



Q$(p. N G > ^Q^ma* |A, < -} R A,y)) (3.1.4) 

AT" JV -/V 

= / TTlx^-^ + A^A.^expl-^V^ + A,) 2 }!!^ 

i{|A 1 |<|}nA„ - 



i=l i=l 
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But, when Ai < A2 ■ ■ • < \n and since we have constructed the x l,N, s such that 
ir 1,JV < x 2 ' N < ■ ■ ■ < x > , we have, for any integer numbers the lower 

bound 



\x ' — x- 



Ai-Ajl >max{|ar , ' JV - ^ ,iV |, |Aj - A»|>. 



As a consequence, gives 



i+Kj 

N-l JV 

x J] |^-^|lexp{-^(|^|+5) 2 } 

i=l i=l 
. JV-1 iV 

x / TT |A i+ i-A^TT rfA ' ( 3 - L5 ) 

Moreover, one can easily bound from below the last term in the right hand side 
of jXT~5|) and find 

N-l fj ^ 1 ^(JV-1) , g x (/3/2+l)(JV-l) + l 



Hence, l|3.1.5[) implies : 

N-l 

Q N p (A w £%{)) > I] l^-^'T II k^-ar^l^e-* ^M' N f 

i+Kj i=l 



X 



f 



(JV-1) / , x (/3/2+l)(AT-l) + l 



13/2 + 1 J \2N J 



N 



exp{-N5j2 \% hN \ - N 2 S 2 }. (3.1.7) 



Moreover 

N 



^T( x l ' N ) 2 - V log 

2(N + 1)^ { ' 2(N+l) 2 ^ m 

' 1=1 i^j 



\x i > N -x j > N \ 



< \ I x 2 dv{x) - (3 I log(y - x)dv{x)dv{y) (3.1.8) 



2 
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Indeed, since x — ► log(x) increases on ]R + , we notice that, with = [a;*' , x t+1 ' ], 
logfe - x)dv{x)dv{y) (3.1.9) 



c 1 > N <x<i/<x Jv . JV 

£ log(ar»' +1 -" - x^K 2 G /i x I j]X < y) 

i<«<i<JV-i 

The same arguments holds for J x 2 dv(x) and Si=i( a ' l ' iV ) 2 - We can conclude 
that 

1 , -*N , jf 



lhmnf — logQ^ (A^GB^f)) 
> -<5 / Ixldi/fa:) - 5 2 + liminf{/3 / 

J w^oo y^,. 



log(y - x)dv(x)dv(y) 

<x<y<x N - N 



-\ Jx 2 dv(x)}. (3.1.10) 

= -5 J \x\dv{x) - 5 2 - ^ J J f{x,y)dv{x)dv(y). 

Letting 5 going to zero gives the result. 

Note here that we have used a lot monotonicity arguments to show that our 
approximation scheme converges toward the right limit. Such arguments can not 
be used in the setup considered by G. Ben Arous and O. Zeitouni |llj when the 
eigenvalues are complex; this is why they need to perform first a regularization 
by convolution. 



3.2. Discussion and open problems 

There are many ways in which one would like to generalize the previous large 
deviations estimates; the first would be to remove the assumption that the 
entries are Gaussian. It happens that such a generalization is still an open 
problem. In fact, when the entries of the matrix are not Gaussian anymore, the 
law of the eigenvalues is not independent of that of the eigenvectors and becomes 
rather complicated. With O. Zeitouni [M], I proved however that concentration 
of measures result hold. In fact, set 

X a (uj) = ((XAfwJjyJKyxjv , Xa(w) = X^(w), {X A (uj)) lj = ^4 13 w tJ 
with 

w := (uj r + iuj 1 ) = (uJtj)i<ij<N = (wy + V-lVij)i<i,j<N, Wij = 
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{uiji 1 < « < J < N} independent complex random variables with laws {Pij, 1 < 
i < j < N}, P^ being a probability measure on C with 

PiMi e •) = J lI u + l ve-P l I j {du)P! J {dv), 

and A is a self-adjoint non-random complex matrix with entries {Aij, 1 < i < 
j < A^} uniformly bounded by, say, a. Our main result is 

Theorem 3.2. a) Assume that the (Pij,i < € N) are uniformly compactly 
supported, that is that there exists a compact set K C C so that for any 1 < 
i < j < N, Pij(K c ) = 0. Assume f is convex and Lipschitz. Then, for any 
8>6 (N):=8\K\ y /*a\f\c/N>Q, 

P N (\tr{f(X A (uj))) - E N [tr(f(X A ))}\ > 8) < 4e~ * 2 («-«oW) 2 ^ 

6j // i/ie (Py, 1 < i < i < AT) satisfy the logarithmic Sobolev inequality with 
uniform constant c, then for any Lipschitz function f , for any 8 > 0, 

¥ N (\tr(f(X A (uj))) - E N [tr(f(X A ))}\ > 8) < 2e . 

This result is a direct consequence of standard results about concentration of 
measure due to Talagrand and Herbst and the observation that if / : 1R — > M is 
a Lipschitz function, then lo — * tr(/(X^(o;))) is also Lipschitz and its Lipschitz 
constant can be evaluated, and that if / is convex, oj — > tv(f{X A (uj))) is also 
convex. 

Note here that the matrix A can be taken equal to {A^ = 1, 1 < i,j < N} 
to recover results for Wigner's matrices. However, the generalization is here 
costless and allows to include at the same time more general type of matrices 
such as band matrices or Wishart matrices. See a discussion in |fi4| . 



Eventhough this estimate is on the right large deviation scale, it does not 
precise the rate of deviation toward a given spectral distribution. This problem 
seems to be very difficult in general. The deviations of a empirical moments 
of matrices with eventually non-centered entries of order A^ -1 are studied in 
in this case, deviations are typically produced by the shift of all the entries 
and the scaling allows to see the random matrix as a continuous operator. This 
should not be the case for Wigner matrices. 



Another possible generalization is to consider another common model of 
Gaussian large random matrices, namely Wishart matrices. Sample covariancc 
matrices (or Wishart matrices) are matrices of the form 

Yn,m — A \\ • / / W A' v \, . 

Here, Ajy,m is an A^ x M matrix with centered real or complex i.i.d. entries 
of covariance A^ _1 and Tm is an M x M Hermitian (or symmetric) matrix. 
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These matrices are often considered in the limit where M/N goes to a constant 
a > 0. Let us assume that M < N, and hence a € [0, 1], to fix the notations. 
Then, Yn,m has N — M null eigenvalues. Let (Ai, • • • , Am) be the M non trivial 
remaining eigenvalues and denote fi M = M~ 1 53i=i ^ • In the case where Tm 
/ and the entries of Xn,m are Gaussian, Hiai and Petz [7j] proved that the law 
of ft M satisfies a large deviation principle. In this case, the joint law of the 
eigenvalues is given by 

N 

Z T M i<j i=l 

so that the analysis performed in the previous section can be generalized to this 
model. In the case where Tm is a positive definite matrix, we have the formula 

dv M {Xi,--- 

with -D(A) the M x Af diagonal matrix with entries (Ai, • • • , Am) and iffl the 
spherical integral 

lf{D N ,E N ) j exp{N^Tr(UD N U*E N )}dm N (U) 

with m N the Haar measure on the group of orthogonal matrices O(N) if j3 = 
1 (resp. [/(iV) if /3 = 2, resp. 5(AT) if /? = 4). Z N (T M ) is the normalizing 
constant such that a M has mass one. This formula can be found in [73 (58) and 
(95)]. Hence, we see that the study of the deviations of the spectral measure 
fi M = jj X)j=i when the spectral measure of Tm converges, is equivalent to 
the study of the asymptotics of the spherical integral 1^ (Dn> En) when the 
spectral measures of Dn and En converge. 

The spherical integral also appears when one considers Gaussian Wigner 
matrices with non centered entries. Indeed, if we let 

with a self adjoint deterministic matrix (which can be taken diagonal with- 
out loss of generality) and a Gaussian Wigner matrix X^^ as considered in the 
previous section, then the law of the eigenvalues of Y N -@ is given by 

dQ N (Xi, • ■ ■ , X N ) = 



niA,-A. 



,/3 -fTr((M«) 2 )- 



N 

f E^Li A ?l<^) (D(X),M N ) Y[ dXi 



(3.2.11) 
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We shall in the next section study the asymptotics of the spherical integral by 
studying the deviations of the spectral measure of Y*^. This in turn provides 
the large deviation principle for the law of fi M under a^ M by Laplace's method 
(see HI], Theorem 1.2). 

Let us finish this section by mentioning two other natural generalizations in 
the context of large random matrices. The first consists in allowing the covari- 
ances of the entries of the matrix to depend on their position. More precisely, one 
considers the matrix Z N ' 13 = (z^'^)i<i,j<N with (z™' 13 = z^ , 1 < i < j < N) 
centered independent Gaussian variables with covariance A r ~ 1 c^' /3 . c^'' 3 is in 

general chosen to be c^ ,/3 = 4>{jji jj)- When 4>{x,y) = l\ x + y \< c , the matrix is 
called a band matrix, but the case where is smooth can as well be studied as a 
natural generalization. The large deviation properties of the spectral measure 
of such a matrix was studied in [HO] but only a large deviation upper bound 
could be obtained so far. Indeed, the non-commutativity of the matrices here 
plays a much more dramatic role, as we shall discuss later. In fact, it can be 
seen by the so-called linear trick (see jH^J for instance) that studying the devia- 
tions of the words of several matrices can be related with the deviations of the 
spectral measure of matrices with complicated covariances and hence this last 
problem is intimately related with the study of the so-called microstates entropy 
discussed in Chapter The second generalization is to consider a matrix where 
all the entries are independent, and therefore with a priori complex spectrum. 
Such a generalization was considered by G. Ben Arous and O. Zeitouni JT] 
in the case of the so-called Ginibre ensemble where the entries are identically 
distributed standard Gaussian variables, and a large deviation principle for the 
law of the spectral measure was derived. Let us further note that the method 
developed in the last section can as well be used if one considers random par- 
titions following the Plancherel measure. In the unitary case, this law can be 
interpreted as a discretization of the GUE (see S.Kerov [H5J or K. Johansson 
|74| ) because the dimension of a representation is given in terms of a Coulomb 
gas potential. Using this similarity, one can prove large deviations principle for 
the empirical distribution of these random partitions (see |H21)i the rate function 
is quite similar to that of the GUE except that, because of the discrete nature 
of the partitions, deviations can only occur toward probability measures which 
are absolutely continuous with respect to Lebesgue measure and with density 
bounded by one. More general large deviations techniques have been developed 
to study random partitions for instance in |4l)j for uniform distribution. 

Let us finally mention that large deviations can as well be obtained for the 
law of the largest eigenvalue of the Gaussian ensembles (cf. e.g. [Sj, Theorem 6.2) 



Chapter 4 

Asymptotics of spherical 
integrals 



In this chapter, we shall consider the spherical integral 

I ( £\D N ,E N ) := J exp{N^Tr(UD N U*E N )}dm^ N (U) 

where mP N denotes the Haar measure on the orthogonal (resp. unitary, resp. 
symplectic) group when (3—1 (resp. (3 = 2, resp. (3 = 4). This object is actually 
central in many matters; as we have seen, it describes the law of Gaussian 
Wishart matrices and non centered Gaussian Wigner matrices. It also appears 
in many matrix models described in physics; we shall describe this point in 
the next chapter. It is related with the characters of the symmetric group and 
Schur functions (cf. [107\ ) because of the determinantal formula below. We shall 
discuss this point in the next paragraph. 

(2) 

A formula for I N was obtained by Itzykson and Zuber (and more generally 
by Harish-Chandra) , see [53] Appendix 5]; whenever the eigenvalues of Djv and 
En are distinct then 

r (3)/n „ ^_dek{expND N (i)E N (j)} 



A(D N )A(E N ) 



where A(D N ) = U l<3 ( D N(j)-D N (i)) and A(E N ) = U i<:j (E N (j)- E N (i)) are 
the VanderMonde determinants associated with Dn, En- Although this formula 
seems to solve the problem, it is far from doing so, due to the possible cancella- 
tions appearing in the determinant so that it is indeed completely unclear how 
to estimate the logarithmic asymptotics of such a quantity. 

To evaluate this integral, we noticed with O. Zcitouni [HE] that it was enough 
to derive a large deviation principle for the law of the spectral measure of non 
centered Wigner matrices. We shall detail this point in Section UTTl To prove 
a large deviation principle for such a law, we improved a strategy initiated 



98 



A. Guionnet/ Large deviations for random matrices 



99 



with T. Cabanal Duvillard in [20 ED] which consists in considering the matrix 
Y N 'P = D N + X ff ^ as the value at time one of a matrix-valued process 

Y * r >f>(t) = D N + H N 'P(t) 

where H*' 2 (resp. H^ 1 , resp. H*' 4 ) is a Hermitian (resp. symmetric, resp. 
symplectic) Brownian motion, that is a Wigner matrix with Brownian motion 
entries. More explicitly, H N 'P is a process with values in the set of N x N 
self-adjoint matrices with entries t > 0, i < j} constructed via in- 

dependent real valued Brownian motions (Bfj, 1 < i < i < ./V, 1 < fc < 4) by 

H?f =1 V % (4.0.1) 
k,l \ -^2=Bu, i£k = I. 

where (e^, 1 < k < (3) is the basis of R/ 3 described in the previous chapter. The 
advantage to consider the whole process Y N '@ is that we can then use stochastic 
differential calculus and standard techniques to study deviations of processes by 
martingales properties as initiated by Kipnis, Olla and Varadhan 88 . The idea 
to study properties of Gaussian variables by using their characterization as time 
marginals of Brownian motions is not new. At the level of deviations, it is very 
natural since we shall construct infinitesimally the paths to follow to create a 

(2) 

given deviation. Actually, it seems to be the right way to consider 1^ when 
one realizes that it has a determinantal form according to l|1.0.1|l and so is by 
nature related with non-intersecting paths. There is still no more direct study of 
these asymptotics of the spherical integrals; eventhough B. Collins [HI] tried to 
do it by expanding the exponential into moments and using cumulants calculus, 
obtaining such asymptotics would still require to be able to control the conver- 
gence of infinite signed series. In the physics literature, A. Matytsin j5J derived 
the same asymptotics for the spherical integrals than these we shall describe. His 
methods are quite different and only apply a priori in the unitary case. I think 
they might be written rigorously if one could a priori prove sufficiently strong 
convergence of the spherical integral as N goes to infinity. As a matter of fact, I 
do not think there is any other formula for the limiting spherical integral in the 
physics literature, but mostly saddle point studies of this a priori converging 
quantity. I do however mention recent works of B. Eynard and als. [SU| and M. 
Bertola ^3 wno produced a formula of the free energy for the model of matrices 
coupled in chain by means of residues technology. However, this corresponds to 
the case where the matrices En : Dn of the spherical integral have a random 
spectrum submitted to a smooth polynomial potential and it is not clear how to 
apply such technology to the hard constraint case where the spectral measures 
of I? at, En converge to a prescribed limit. 
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4.1. Asymptotics of spherical integrals and deviations of the 

spectral measure of non centered Gaussian Wigner matrices 

Let ~Y N -' 13 — T) N + 'X. N 'P with a deterministic diagonal matrix D N and X"'' 3 
a Gaussian Wigner matrix. We now show how the deviations of the spectral 
measure of Y N '@ are related to the asymptotics of the spherical integrals. To 
this end, we shall make the following hypothesis 

Hypothesis 4.1. 1. There exists d max G R + such that for any integer num- 
ber N, ftp ({\x\ > dmax]) = and that ftp converges weakly toward 

2. jj£ N converges toward he € T'(M) while t i E N {x 2 ) stays uniformly bounded. 
Theorem 4.2. Under hypothesis \4-l\ 

1) There exists a function g : [0,1] x R + i— ► R + , depending on (Ie only, such 
that g(8, L) — >s—>o for any L € K + , and, for En, En such that 



d(^,fi E ) + d(fi%.fi E )<S/2 : 



(4.1.2) 



it holds that 



We define 



x'dfL%(x)+ x 2 d^(x)<L, 



lim sup 

JV— >oo 



1 lf(D N ,E 



N> 



I N V (D N ,E N ) 



<g(s,L) 



(4.1.3) 



IW (p D , me) = Hm sup -L i og lf(D N ,E N ), 
jvtoo 

L (P) (Mo, Me) = Hm inf -L log if {D N , E N ) , 

Njoo JV 

By the preceding, -T^(M-DjMe) and L (pDi Me) are continuous functions on 
{(Me,Me) S V(R) 2 : J x 2 d^i E (x) + J x 2 d[i D (x) < L} for any L < oo. 

for any probability measure fi S V(JR), 



1 



inf lim inf — log F (<2(my , M) < <*) 



inf limsup— j log J F'(d(MY,M) < s ) ■= -Jp{^D,n)- 

<5^0 at^qo iV 



5; We let, for any fj,eV(M), 



hip) = T / x 2 dv{x) ~ 7T / log |a; - y\dfj,(x)dfj,(y). 
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If fJ-E N converges toward \xe <= V{M) with IpifiE) < oo, we have 



JffifJ-D,^) + IpifJ-E) - inf Ifj(fi) + — I x 2 dfi D {x). 



i 



Before going any further, let us point out that these results give interesting 
asymptotics for Schur functions which are defined as follows. 

• a Young shape A is a finite sequence of non-negative integers (Ai , A 2 , . . . , Xi ) 
written in non-increasing order. One should think of it as a diagram whose 
ith line is made of Ai empty boxes: for example, 



corresponds to Ai = 4, A2 = 4, A3 = 3, A4 = 2. 



We denote by |A| = J2i ^» the total number of boxes of the shape A. 
In the sequel, when we have a shape A = (Ai, A2, . . .) and an integer N 
greater than the number of lines of A having a strictly positive length, we 
will define a sequence I associated to A and N, which is an iV-tuple of 
integers k = A, + N — i. In particular we have that h > h > ■ ■ ■ > In > 
and li — > 1. 

for some fixed N € IN, a Young tableau will be any filling of the Young 
shape above with integers from 1 to N which is non-decreasing on each 
line and (strictly) increasing on each column. For each such filling, we 
define the content of a Young tableau as the iV-tuple (^1, . . . , fi N ) where 
Hi is the number of i's written in the tableau. 



For example, 



whereas 



1 



1 



is allowed (and has content (2, 2, 2)), 



is not. 



Notice that, for N € IN, a Young shape can be filled with integers from 1 
to N if and only if Ai = for i > N. 

for a Young shape A and an integer N, the Schur polynomial s\ is an 
element of C[a;i, . . . , xn] defined by 



,xn) 



N ! 



(4.1.4) 



where the sum is taken over all Young tableaux T of fixed shape A and 
(fii, . . . , /Ujv) is the content of T. On a statistical point of view, one can 
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think of the filling as the heights of a surface sitting on the tableau A, /i; 
being the surface of the height i. s\ is then a generating function for these 
heights when one considers the surfaces uniformly distributed under the 
constraints prescribed for the filling. Note that s\ is positive whenever 
the and, although it is not obvious from this definition (cf. for 

example |107| for a proof), s\ is a symmetric function of the Xi's and 
actually (s\, A) form a basis of symmetric functions and hence play a key 
role in representation theory of the symmetric group. If A is a matrix 
in A^at(C), then define s\(A) = s\(Ai, . . . ,An), where the Aj's are the 
eigenvalues of A. Then, by Weyl formula (cf. Theorem 7.5.B of |130| ). for 
any matrices V, W , 

s x (UVU*W)dm N (U) = ^-sx{V)s x (W). (4.1.5) 

Then, because s\ also has a determinantal formula, we can see (cf. |1()7| and 

E2) 

•><-)-fK) 4 (W ( "- 6) 

where denotes the diagonal matrix with entries iV _1 (Ai — i + N) and A the 
Vandermonde determinant. Therefore, we have the following immediate corol- 
lary to Theorem 14.21 : 

Corollary 4.3. Let X N be a sequence of Young shapes and set Dn = [N~ l (X^— 
i + N))i<i<N ■ We pick a sequence of Hermitian matrix En and assume that 
(Dpf, -Ejv)iveN satisfy hypothesis \4-l\ and that E(/Ud) > — oo. Then, 

lim _Ll 0gSAN (e B ") 

= lW(HE,»n)-l J iog[J e ax+ ^- a ^da}d^ E (x)dfi E (y) + ^ D ). 

Proof of Theorem 14.21 : To simplify, let us assume that and E^ are 
uniformly bounded by a constant M, Let 8' > and {Aj}j £ j be a partition of 
[— M, M] such that \Aj\ G [6' , 26'} and the endpoints of Aj are continuity points 
of He- Denote 

Ij = {i : E N (ii) G Aj}, Ij = {i : E N (ii) G Aj} . 



\» E {Aj) - \Ij\/N\ + \fi E (Aj) - \Ij\/N\ <5. 

We construct a permutation ctjv so that \E(ii) — E{aj^{i), Ujv(«))l < 25 except 
possibly for very few i's as follows. First, if \Ij\ < \Ij\ then Ij := Ij, whether 
if \Ij\ > \Ij\ then \Ij\ = \Ij\ while Ij C Ij. Then, choose and fix a permutation 
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cj/v such that apj(Ij) C Ij. Then, one can check that if Jq = {i : \E(ii) — 
E(a N (i),a N (i))\<25}, 

\Jo\ > \lJ j a N (I j )\=Y^WN(T j )\>N-J2\Wj\ 

3 3 

M 

> TV-maxd/,-! - I,- J\>N-2SN— 

3 

Next, note the invariance of I^\Dn, En) to permutations of the matrix ele- 
ments of -Djv- That is, 

lf{D Nl E N ) = J exp{N^Tr(UD N U*E N )}dm N (U) 

eMN^J2 u ^ L> N(kk)E N {ii)}dm l3 N (U) 



exp{Nj2 u ik D N{kk)E N {a N (i)(r N (i))}drn N (U). 

i,k 

But, with d max = maxfc \Djy(kk)\ bounded uniformly in TV, 
N- l Y J ^ikDN{kk)E N {a N {i)a N {i)) 

i,k 

= N- 1 J2Y, u ^ D N(kk)E N (a N (i)a N (i)) 

ieJo k 

(a N (i)a N (i)) 

ft J a k 

< N- 1 u 2 lk D N (kk){E N (ii) + 26) + N^d^MU^l 

i,k 

M 2 S 



< N- 1 u 2 k D N (kk)E N (ii) + d max 



Hence, we obtain, taking d ma x^ — VS, 

I { f(D N ,E N ) < e N ^I^(D N ,E N ) 

and the reverse inequality by symmetry. This proves the first point of the the- 
orem when (En, En) are uniformly bounded. The general case (which is not 
much more complicated) is proved in [65j and follows from first approximating 
En and En by bounded operators using 

The second and the third points are proved simultaneously : in fact, writing 
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P(d(f4,v) < S) = \ [ e -m T r«^-n N f )dYN ,p 

Z'jy Jd(uS.,u)<S 



N PTr(n 2 \ ^ N 

— / lf(D(X) 7 D N )e-^^^A(Xri[dX l 

Z N ^(iLi^^X* i=l 



with the normalizing constant 

Z P n = J e -fTn(Y N -^D N f) dY Nj3 = J e -fTr((Y«.f) 2 ) dY ^/3 ; 

we see that the first point gives, since Ij^\D(X), Dn) is approximately constant 
on MTV- 1 £ , M ) < (5} n {d(£g„ , M < 6}, 

e *V<'W)-fAS„(* 2 )) . N 

w f .Lv, ^^^^n^ 

= e-^^ +N2lW) ^P(d(&,») < S) 

where Ajv,<5 ~ S/v.a means that iV -2 log Apf^B^ s goes to zero as N goes to 
infinity first and then S goes to zero. 

An equivalent way to obtain this relation is to use (|l.U.lf) together with the 
continuity of spherical integrals in order to replace the fixed values at time one of 
the Brownian motion B\ = Y by an average over a small ball {B\ : d(fig , /i) < 
6}. 

The large deviation principle proved in the third chapter of these notes shows 
2) and 3). 

□ 

Note for 3) that if Ip(jj,E) = +oo, J(jj,d, Mb) = +°° so that in this case 
the result is empty since it leads to an indetermination. Still, if I@(hd) < oo, 
by symmetry of we obtain a formula by exchanging /i£> and /ie. If both 
Ip{n>D) and Ip(he) are infinite, we can only argue, by continuity of that for 
any sequence (/i|,) e> o of probability measures with uniformly bounded variance 
and finite entropy Ip converging toward fj,g, 

g r 

I^'{hd,He) = lim {-J fj (p D , Li e E ) + Ip(fJ. e E )} - inf Ip + ~r x 2 dfj, D (x). 



A more explicit formula is not yet available. 

Note here that the convergence of the spherical integral is in fact not obvious 
and here given by the fact that we have convergence of the probability of devi- 
ation toward a given probability measure for the law of the spectral measure of 
non-centered Wigner matrices. 
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4.2. Large deviation principle for the law of the spectral measure of 
non-centered Wigner matrices 

The goal of this section is to prove the following theorem. 

Theorem 4.4. Assume that Djq is uniformly bounded with spectral measure 
converging toward /j,d ■ Then the law of the spectral measure fi^ of the Wigner 
matrix Y N '@ — Dn + X^ ,JV satisfies a large deviation principle with good rate 
function J^(/xd, .) in the scale N 2 . 

By Bryc's theorem <|2.U.2|) . it is clear that the above large deviation principle 
statement is equivalent to the fact that for any bounded continuous function / 



exists and is given by — inf{ Jp([iD, v) — fiy)}. It is not clear how one could 
a priori study such limits, except for very trivial functions /. However, if we 
consider the matrix valued process Y N '@(t) = D N + H N ' ,3 (t) with Brownian 
motion H" 1 ' 3 described in (|4.U.1|I and its spectral measure process 



we may construct martingales by use of Ito's calculus. Continuous martingales 
lead to exponential martingales, which have constant expectation, and therefore 
allows one to compute the exponential moments of a whole family of functionals 
of fi N . This idea will give easily a large deviation upper bound for the law of 
(jlf ,t £ [0, 1]), and therefore for the law of fj,^-, which is the law of fi^ . The 
difficult point here is to check that it is enough to compute the exponential 
moments of this family of functionals in order to obtain the large deviation 
lower bound. 

Let us now state more precisely our result. We shall consider {fi N (t),t £ 
[0, 1]} as an element of the set C([0, l],V(Mt)) of continuous processes with values 
in V(M). The rate function for these deviations shall be given as follows. For 



any f,g £ C fc 2,1 (E x [0, 1]), any s < t £ [0, 1], and any v. £ C([0, l],V{M)), we let 
S s '\v,f) = [ f{x,t)dv t {x) - [ f(x,s)dv s (x) 



on V(M), 







(4.2.8) 
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and 

S s > t {vJ)=S s > t {vJ)-\<f,f>l t . (4.2.9) 
Set, for any probability measure [i € V(1R), 

S (u) ■= { +0 ° if V ° ^ M ' 
^ v > • \ S ' 1 ^) :=sup /eC 2.i (Rx[01]) sup < s < t < 1 S' s ^(^/), otherwise. 

Then, the main theorem of this section is the following 

Theorem 4.5. l)For any /i £ T'(IR), is a good rate junction on C([0, l],'P(iR)), 
i.e. {v G C([0, < M} is compact for any M S 

2)Assume that 

there exists e > swcft i/mi sup/i^ N (|:r| 5+€ ) < oo, Ad n converges toward hd, 

N 

(4.2.10) 

i/ien the law of (/if, i G [0, 1]) satisfies a large deviation principle in the scale 
N 2 with good rate function S^ D . 

Remark 4.6: In |65| . the large deviation principle was only obtained for marginals; 
it was proved at the level of processes in |66| . 

Note that the application (fit, t 6 [0, 1]) — ► /ii is continuous fromC([0, 1], T'(IR)) 
into V(1R), so that Theorem 14 . 51 and the contraction principle Theorem 12. 51 im- 
ply that 

Theorem 4.7. Under assumption ^. 2. ll\ Theorem \4-4\ is true with 

The main point to prove Theorem l4.5l is to observe that the evolution of jj N is 
described, thanks to Ito's calculus, by an autonomous differential equation. This 
is easily seen from the fact observed by Dyson (see also |93|. Theorem 8.2.1) 
that the eigenvalues (Aj,l < i < N, < t < 1) of (Y N ' (t),O < t < 1) are 
described as the strong solution of the interacting particle system 

d\i = -^S=dB\ + — V l —^dt (4.2.11) 

with diag(Aj, ■ • ■ , A^ ) = D N and (3 = 1, 2 or 4. This is the starting point to use 
Kipnis-Olla-Varadhan's papers ideas [55J These papers concerns the case 
where the diffusive term is not vanishing ((3N is of order one). The large de- 
viations for the law of the empirical measure of the particles following (|4.2.11|l 
in such a scaling have been recently studied by Fontbona 52 in the context of 
Mc Kean-Vlasov diffusion with singular interaction. We shall first recall for the 
reader these techniques when one considers the empirical measures of indepen- 
dent Brownian motions as presented in |57j . We will then describe the necessary 
changes to adapt this strategy to our setting. 
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4-2.1. Large deviations from the hydrodynamical limit for a system 
of independent Brownian particles 

Note that the deviations of the law of the empirical measure of independent 
Brownian motions on path space 

i N 

L » = n^H* eW[0,l],R)) 

are well known by Sanov's theorem which yields (cf. Section 6.2) 

Theorem 4.8. Let W be the Wiener law. Then, the law {L N ) # W® N of L N 
under W® N satisfies a large deviation principle in the scale N with rate function 
given, for fi G T>(C([0, 1],R)), by I(fi\W) which is infinite if /i is not absolutely 
continuous with respect to Lebesgue measure and otherwise given by 

Thus, if we consider 

1 N 

i=i 

since L N -> (/if ,t G [0, 1]) is continuous from V(C([0, 1],R)) intoC([0, 1],P(R)), 
the contraction principle shows immediately that the law of (jlf ,t £ [0,1]) 
under satisfies a large deviation principle with rate function given, for 

P GC([Q,l],P(R)),by 

S(p) = inf{7(/i|W) : (i t ) #/1 = p t Vt G [0, 1]}. 

Here, (xt)#fi, denotes the law of x t under /i. It was shown by Follmer |51j that 
in fact S(p) is infinite unless there exists k G L 2 (pt(dx)dt) such that 

inf / / (d x f(x, t) - k(x, t)f Pt {dx)dt = 0, (4.2.12) 

and for all / G C 2 - J (R x [0,1]), 

%>*(/*) = Pt(ft/t) + \pMft) +Pt(d x ftk t ). 
Moreover, we then have 

S( P ) = \ J Pt{k 2 t )dt. (4.2.13) 

Kipnis and Olla |H2] proposed a direct approach to obtain this result based on 
exponential martingales. Its advantage is to be much more robust and to adapt 
to many complicated settings encountered in hydrodynamics (cf. [EH!)- Let us 
now summarize it. It follows the following scheme 
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• Exponential tightness and study of the rate function S Since the rate 
function S is the contraction of the relative entropy I(.\W), it is clearly a 
good rate function. This can be proved directly from formula (|4.2.13(l as we 
shall detail it in the context of the eigenvalues of large random matrices. 
Similarly, we shall not detail here the proof that fi^W® 1 ^ is exponentially 
tight which reduces the proof of the large deviation principle to the proof 
of a weak large deviation principle and thus to estimate the probability of 
deviations into small open balls (cf. Chapter^. We will now concentrate 
on this last point. 

• Ito's calculus: Ito's calculus (cf. jH2]i Theorem 3.3 p. 149) implies that for 
any function F in C 2,1 (R N x [0, 1]), any t G [0, 1] 

F(Bl • • • , B? , t) = F(0, • • • , 0)+ / d s F(Bl- ■ ■ , B? , s)ds 

Jo 

+ d Xi F(Bl,--- ,B»,s)dBi+- £ d Xi d Xi F(Bl,...,B?,s)ds. 

t = l Jo l<i,i<N J ° 

Moreover, Mf = YliLi Jo d Xi F(Bl, • • ■ , Bf , s)dB\ is a martingale with 
respect to the filtration of the Brownian motion, with bracket 

<M F > t =J2 / [d Xi F(Bl,--- ,B?,s)] 2 ds. 

Taking F(x\ ••• , x N , t) = iV" 1 ^ti f{ B U) = 1 /(*, t)d^(x) = £f(/ t ), 
we deduce that for any / G C 2,1 {R x [0, 1]), 

Mf{t) = Af (ft) - /#(/„) - f ${d s f s )ds - f ${\d 2 J s )ds 

Jo Jo z 

is a martingale with bracket 

<Mf >t=±J*^((d x f s ) 2 )d S . 

The last ingredient of stochastic calculus we want to use is that (cf. |82|. 
Problem 2.28, p. 147) for any bounded continuous martingale mt with 
bracket < m > t , any A £ R, 

|exp(Am t -y <m> t ),te [0,1] | 

is a martingale. In particular, it has constant expectation. Thus, we deduce 
that for all / G C 2,1 (R x [0, 1]), all t G [0, 1], 

E[exp{iV(Mf (i) - 1 < Mf > t )}] = 1. (4.2.14) 
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• Weak large deviation upper bound 

We equip C([0, 1],V(M)) with the weak topology on V(1R) and the uni- 
form topology on the time variable. It is then a Polish space. A distance 
compatible with such a topology is for instance given, for any /i, v G 
C([0,l},V(M)),by 

D(/j,,u) = sup d(nt,u t ) 
te[o,i] 

with a distance d on V(IR) compatible with the weak topology such as 
d(nt,ft)= sup | f(x)d/j, t (x) - / f(x)dv t {x)\ 

\f\ c <l J J 

where \f\c is the Lipschitz constant of / : 
|/Lc=sup|/0r)|+ sup 
We prove here that 

Lemma 4.9. For any p G C([0, 1],P(M)), 

lim sup lim sup — log W®^ (D(fi N ,p) <6)<-S(p). 

8^0 N^oo N 

Proof : Let p G C([0, 1],P(K)). Observe first that if p ^ <5 , since /i^ = 5 
almost surely, 

lim sup lim sup -jr log yV®^ [ sup d{fif ,p t ) <b~\= — oo. 
<5^0 JV^oo A \te[o,x] / 

Therefore, let us assume that j>o = <^o- We set 

B(p,<S) = G C([0,1],P(R)) : Dfop) < 6}. 

Let us denote, for f,g G C^QR x [0,1]), ^ G C([0, 1], P(M)), < t < 1, 

T >*(/,/i)= Mt (/ t )-Mo(/o)- I n,{d.f.)d8- (\ s {\d 2 J s )ds 

Jo Jo L 

and ^ 

</,5 >°'*:= / n s {d x f s d x g s )ds. 
Jo 

Then, by 14.2. 14(1 . for any i < 1, 

E[exp{jV(T '*(/,^) - - < /,/ >°£)}] = 1. 
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Therefore, if we denote in short T(f, fi) = T 0,1 (/, fx) - A < /, / >°<\ 

/ e NT(f,p, N )\ 



W® N (D{f, N lP ) <S)= W® N (^ (A * p) < v ., , „, , 
exp{-JV inf T(/, .)}W 8Ar (lu(^, P )< 5 e 



cxp{-N inf T^OlW^fe^W)) (4.2.15) 

Bin. 5) \ / 



< 

exp{-7V inf T(/, M )} 

/xe-B(p,<5) 



Since /i — * T(/, /i) is continuous when / € C b ' (K x [0, 1]), we arrive at 

limsuplimsup — logW®^ [ sup d(p,? ,p t ) < 5 ) < -T(f,p). 
5^0 N^oo jV \te[o,i] J 

We now optimize over / to obtain a weak large deviation upper bound 
with rate function 

S(p) = sup ( T °'\f,p)-^<fJ> /) 

/GC^^Rxpa]) z 

sup sup(Ar ^(/,p) - ±- < /,/ >M) 
/ec^^Rxp.i]) A 6 R 1 

1 T - 1 ^) 



2 



sup (4.2.16) 

/ec^CRxp,!]) < /> / >p' 



From the last formula, one sees that any p such that S(p) < oo is such 
that / — > Tf(p) is a linear map which is continuous with respect to the 
norm H/Hp 1,1 = (< /, / >p' 1 )% ■ Hence, Riesz's theorem asserts that there 
exists a function k verifying 14.2.12I4.2.13J1 . 
• Large deviation lower bound The derivation of the large deviation upper 
bound was thus fairly easy. The lower bound is a bit more sophisticated 
and relics on the proof of the following points 

(a) The solutions to the heat equations with a smooth drift are unique. 

(b) The set described by these solutions is dense in C([0, 1],V(M)). 

(c) The entropy behaves continuously with respect to the approximations 
by elements of this dense set. 

We now describe more precisely these ideas. In the previous section (see 
(I4.2.15|l ). we have merely obtained the large deviation upper bound from 
the observation that for all v e C([0, 1], V(M)), all S > and any / € 
C b 24 ([0,l],M), 

ElVes^) expfVr * 1 ^,/)- \ < /,/>££))] 
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< E[exp (n(T°'\H n J) - \ < /,/ >Ji)J] = 1. 

To make sure that this upper bound is sharp, we need to check that for 
any v € C([0, 1],V(M)) and S > 0, this inequality is almost an equality for 
some k, i.e there exists k £ C 2,1 ([0, 1],K), 

1 E[V eB(j/>4) exp(iV(r ' 1 (A Ar ,A)-5 <*>*>££))] 
lim inf — log 7 r — > 0. 

N ^°° N E[exp {N(T0A(fi«,k) - § < fc,* >J£))] 

In other words that we can find a A: such that the probability that fi N 
belongs to a small neighborhood of v under the shifted probability measure 



T>N,k 



exp(N(T°>\fi N ,k)-±<k,k> fi)) 
E[exp (N(T0A(fiN, k)-\<k,k >J£))] 



is not too small. In fact, we shall prove that for good processes v, we can 
find k such that this probability goes to one by the following argument. 
Take k e C^QSL x [0, 1]). Under the shifted probability measure F N < k , it 
is not hard to see that p, N is exponentially tight (indeed, for k 6 C 2,1 (R x 
[0,1]), the density of P N > k with respect to P is uniformly bounded by 
e c{k)N w j^jj a nn ite constant C(k) so that f N ' k o (/i^) -1 is exponentially 
tight since Po (/i^) -1 is). As a consequence, pr is almost surely tight. We 
let /i. be a limit point. Now, by Ito's calculus, for any / S C b ' (K x [0, 1]), 
any < t < 1, 

T^{ii N J)= f f d x f u (x)d x k u (x)d^(x)du + M t N (f) 



with a martingale (M t iv (/),i S [0, 1]) with bracket 

(iV- 2 J * f(d x f(x)) 2 d^(x)ds, t e [0, 1]). Since the bracket of Mf(/) goes 
to zero, the martingale (Mf(f),t S [0,1]) goes to zero uniformly almost 
surely. Hence, any limit point /i. must satisfy 

T°> 1 (jt,f) = J J d x f u (x)d x k u (x)dpt u (x)du (4.2.17) 

for any / e C b 2,1 (K x [0,1]). 

When (fx, k) satisfies (|4.2.17l) for all / S C 2,1 (M x [0, 1]), we say that k is 
the field associated with /i. 

Therefore, if we can prove that there exists a unique solution v. to 1)4.2.17(1 , 
we see that fi N converges almost surely under ¥ N < k to this solution. This 
proves the lower bound at any measure- valued path v. which is the unique 
solution of (|4.2.17|l . namely for any k £ C^'^R. x [0, 1]) such that there 
exists a unique solution to (|4.2.17() . 
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lim inf lim inf — log 

5^0 N->oo N 



sup d(j$ , i/k) <5 

v*6[0,l] , 



= lim inf lim inf — log 

5^0 N^aa N 



7>N ,k 



1. 



,-JVT(fe,A N ) 



su Pte[o,i] d (Af ,v k )<B e 

> -T(k, v k ) + liminflkkf 1 logP™ (l supt£[o ^ „ (Afr , Vfc)<a ) 



(4.2.18) 



where we used in the second line the continuity of a — > T(/^, fc) due to our 
assumption that k € C fe ' (M x [0, 1]) and the fact that 

pAr,fc Usuptg^i] d(£f ,«/*)<*) S° es t0 one in thc third line - Hence, the ques- 
tion boils down to uniqueness of the weak solutions of the heat equation 
with a drift. This problem is not too difficult to solve here and one can see 
that for instance for fields k which are analytic within a neighborhood of 
the real line, there is at most one solution to this equation. To generalize 
H4.2.18J1 to any v E {S < oo}, it is not hard to see that it is enough to 
find, for any such is, a sequence Vk n for which H4.2.18[l holds and such that 

lim u kn = v, lim S(u kn ) = S(u). (4.2.19) 

Now, observe that S is a convex function so that for any probability mea- 
sure p e , 

S(fi*p e )< f S((.-x) # u)p € (dx)=S(u) (4.2.20) 

where in the last inequality we neglected the condition at the initial time 
to say that S((. — x)#u) = S(fi) for all x. Hence, since S is also lower 
semicontinuous, one sees that S(u * p e ) will converge toward S(u) for any 
H with finite entropy S. Performing also a regularization with respect to 
time and taking care of the initial conditions allows to construct a sequence 
v n with analytic fields satisfying 1)4.2.19(1 . This point is quite technical but 
still manageable in this context. Since it will be done quite explicitly in 
the case we are interested in, we shall not detail it here. 

4-2.2. Large deviations for the law of the spectral measure of a 
non-centered large dimensional matrix-valued Brownian 
motion 

To prove a large deviation principle for the law of the spectral measure of 
Hermitian Brownian motions, the first natural idea would be, following 14.2.11(1 . 
to prove a large deviation principle for the law of the spectral measure of 
L N : t — > iV _1 53i=i ^s/N~ 1 B i (tV ^° use Kirsanov theorem to show that the 
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law we are considering is absolutely continuous with respect to the law of the 
independent Brownian motions with a density which only depend on L N and 
conclude by Laplace's method (cf. Chapter |5J). However, this approach presents 
difficulties due to the singularity of the interacting potential, and thus of the 
density. Here, the techniques developed in |HZ] will however be very efficient 
because they only rely on smooth functions of the empirical measure since the 
empirical measure are taken as distributions so that the interacting potential 
is smoothed by the test functions (Note however that this strategy would not 
have worked with more singular potentials). According to (|4.2.11|l . we can in 
fact follow the very same approach. We here mainly develop the points which 
are different. 

ltd 's calculus 

With the notations of (|4.2.7[1 and (|4.2.8[) . we have 

Theorem 4.10 (|29j Lemma 1.1]). l)When (3 = 2, for any N £ N, any 

f € C 2A (R x [0,1]) and any s £ [0,1), (S'' f ((i N , /), s < t < l) is a bounded 
martingale with quadratic variation 

<s-i(, N ,f) > t =± </,/>#. 

2)When = 1 or 4, for any N eN, any f £ Cjf'^R x [0, 1]) and any s £ [0, 1), 
(j5 s ' t (fi N , /) + J s * J d^fiy, s)dpt^ (x)ds, s < t < 1^ is a bounded martingale 
with 

quadratic variation 

<s s >-(p N ,f) > t =^</,/>;i. 

Proof : It is easy to derive this result from Ito's calculus and 14.2.11fl . Let us 
however point out how to derive it directly in the case where f(x) = x k with an 
integer number k £ N and (3 = 2. Then, for any (i,j) £ {1, ~,N}, ltd 's calculus 
gives 

fc-l N 

d(H N ' 2 (t) k ) lJ = J2 E (H w ' a (t) , M(H w . 9 (t))p»CH JV ' a (*)*-'- 1 ) tlJ 

Z=0 p,n=l 

fc-2 N 

+ Jf E E (H iV ' 2 (*) i ) ip (H^ 2 (t)-) nn (H^ 2 (t) fe -'-™- 2 ) p ,^ 

l-\-m=0 p.n—1 

= (H N,2 (t) l dil N - 2 (t)~H. N - 2 (t) k ~ l ~ 1 ) ij 

1=0 
fc-2 

+ J2( k - 1 - l)tr(H Ar ' 2 W)(H^ 2 (i) fc -'- 2 )y^ 

1=0 
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Let us finally compute the martingale bracket of the normalized trace of the 
above martingale. We have 

(Jo Ei=o * (H^ 2 ( S )'dH^ 2 ( s )H^ 2 (.s) fc -'- 1 )) t 

= ^fc 2 tr (f*^(s)^)ds) 
Similar computations give the bracket of more general polynomial functions. 

□ 

Remark 4.11:Observe that if the entries were not Brownian motions but dif- 
fusions described for instance as solution of a SDE 

dxt — dB t + U(xt)dt, 

then the evolution of the spectral measure of the matrix would not be au- 
tonomous anymore. In fact, our strategy is strongly based on the fact that the 
variations of the spectral measure under small variations of time only depends 
on the spectral measure, allowing us to construct exponential martingales which 
are functions of the process of the spectral measure only. It is easy to see that 
if the entries of the matrix are not Gaussian, the variations of the spectral mea- 
sures will depend on much more general functions of the entries than those of 
the spectral measure. 

However, this strategy can also be used to study the spectral measure of other 
Gaussian matrices as emphasized in [551 ED] ■ 

From now on, we shall consider the case where (3 = 2 and drop the subscript 
2 in H W ' 2 , which is slightly easier to write down since there are no error terms 
in Ito's formula, but everything extends readily to the cases (3 = 1 or 4. The 
only point to notice is that 

S p (jj)= sup {s s -W)- 4 < fJ >'/} = ^SiOt) 

0<s<t<l 

where the last equality is obtained by changing / into 2 _1 /3/. 
Large deviation upper bound 

From the previous Ito's formula, one can deduce by following the ideas of [SS] 
(see Section l4.2.H a large deviation upper bound for the measure valued process 
fi N E C([0, 1],V(M))). To this end, we shall make the following assumption on 
the initial condition Dm\ 
(H) 

C D := sup /ig (log(l + \x\ 2 )) < oo, 

NGN 

implying that (/t^ , N 6 N) is tight. Moreover, jl 1 ^ converges weakly, as N 
goes to infinity, toward a probability measure (Id- 

Then, we shall prove, with the notations of 14.2.7() - (|4.2.9|l . the following 
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Theorem 4.12. Assume (H). Then 

(1) S fJ-D is a good rate function on C([0, l],V(]R)). 

(2) For any closed set F of C([0, 1],V(M)), 

l imsup _LiogP(^ €F)<- inf S MD (y). 

Proof : We first prove that S^ D is a good rate function. Then, we show 
that exponential tightness holds and then obtain a weak large deviation upper 
bound, these two arguments yielding (2) (cf. Chapter 0). 

(a) Let us first observe that S^ D (v) is also given, when vq = [Id, by 

S» D (v) = - sup sup //: t . (4.2.21) 

1 /ec^ l (Kx[o,i])0<8<*<i <J,J >u 

Consequently, S^ D is non negative. Moreover, S I1D is obviously lower semi- 
continuous as a supremum of continuous functions. 

Hence, we merely need to check that its level sets are contained in relatively 
compact sets. For K and C compact subsets of V(M) and C([0, 1],R), respec- 
tively, set 

K{K) = {ve c([o, i],v(m)), v t eKVte [o, 1]} 

and 

C(C, /) = {y£ C([0, 1],V(R)), (t -> v t (f)) e C} . 

With (/ n )neN a family of bounded continuous functions dense in the set C C (R) 
of compactly supported continuous functions, and Km and C n compact subsets 
of V(M) and C([0, 1],K), respectively, recall (see O Section 2.2]) that the sets 



K = JC(K M )N f) C(C n J n ) 



are relatively compact subsets of C([0, 1],V(1R)). 

Recall now that K L = C\ n {v € 7>(R) : L n ] c ) < n^ 1 } (resp. C^m = 

R„{/ e C([0, 1],R) : sup |s _ t| < 5 „ |/(s) - /(f)| < n- 1 } R {H/Hoo < M}) are com- 
pact subsets of T-'(K) (resp. C([0, 1],K)) for any choice of sequences (£ n )n£N € 
(M+) N , (5 n )n£N G (K + ) N and positive constant M. Thus, following the above 
description of relatively compact subsets of C ( [0, 1] , V (M) ) , to achieve our proof, 
it is enough to show that, for any M > 0, 

• 1) For any integer m, there is a positive real number so that for any 

ue{S^ D <M}, 

sup i/,(|;b| > L%) < — (4.2.22) 

0<s<l m 

proving that v s E K l m for all s £ [0, 1]. 
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•2) For any integer m and / £ C 2 (R.), there exists a positive real number 6% 
so that for any v £ {S^ D < M}, 

sup |^(/) -i/,(/)|< 1. (4.2.23) 

\t-s\K5™ m 

showing that s — > f s (/) € C^m mjii . 

To prove (|4.2.22|l . we consider, for 5 > 0, /{(x) = log (x 2 (l + fa 2 )" 1 + l) £ 
^(K x [0, 1]). We observe that 

C := sup \\d x fs\\oo + sup Pf/alloo 

0<5<1 0<<J<1 

is finite and, for 5 £ (0, 1], 

d x fs(x) - d x f s (y) 



x-y 



< C. 



Hence, (|4.2.21|l implies, by taking / = fg in the supremum, that for any 5 £ 
(0, 1], any t £ [0, 1], any M . £ {S^ D < M}, 

M) < Mfs) + 2Ct + 2CsfWt. 

Consequently, we deduce by the monotone convergence theorem and letting S 
decrease to zero that for any /i £ {S I1D < M}, 



sup At*(log(x 2 + 1)) < n D (log(x 2 + 1)) + 2C(1 + VM). 
te[o,i] 

Chebycheff's inequality and hypothesis (H) thus imply that for any /x. £ {S^, D < 
M} and any K £ R+ , 



sup nt{\x\ >K)< Zri 

te[o,i] log(A 2 + 1) 

which finishes the proof of (|4.2.22|l . 

The proof of l|4.2.23|) again relies on l|4.2.21|l which implies that for any / £ 
C fc 2 (R), any fi, £ {S flD < M} and any < s < t < 1, 



\M~M\ < I \d 2 J\ Ut-s\+2\\d x f\\ 00 VMy/\t^7\. (4.2.24) 
(b) Exponential tightness 

Lemma 4.13. For any integer number L, there exists a finite integer number 
No £ N and a compact set K-l in C([0, 1],V(1R)) such that \/N > Nq , 

P(A JV £ Kl) < exp{-LN 2 } . 

Proof : In view of the previous description of the relatively compact subsets 
of C([0, 1\,V(M)), we need to show that 
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• a) For every positive real numbers L and m, there is an Nq € N and a 
positive real number -Ml j7TI so that ViV > Nq 

P( sup (|*| > M L , m ) >-)< exp(-LiV 2 ) 

• b) For any / € C^(K), for any positive real numbers L and m, there exists 
an Nq £ N and a positive real number <$L,m,/ such that ViV > iVo 

sup |Af (/) - Af (/)l > -) < exp(-LiV 2 ) 

The proof is rather classical (it uses Doob's inequality but otherwise is closely 
related to the proof that 5 MD is a good rate function); we shall omit it here (see 
the first section of j^Hl for details). 

(c) Weak large deviation upper bound : We here summarize the main argu- 
ments giving the weak large deviation upper bound. 

Lemma 4.14. For every process v in C([0, 1], T'(M)), if Bs{v) denotes the open 
ball with center v and radius 5 for the distance D, then 

limlimsup-^logP(A Ar G B s (v)) < S^ D {u) 
The arguments are exactly the same as in Section 14.2.11 



Large deviation lower bound 

We shall prove at the end of this section that 
Lemma 4.15. Let 

MToo = {he C'QR x [0, 1]) n C([0, 1], L 2 (R)); 

3(C,e) e (0,oo); sup \h t (X)\ < Ce~^} 
te[o,i] 

where ht stands for the Fourier transform of h t . Then, for any field k inJAJ-'ooi 
there exists a unique solution Vk to 

S 8 '\f,v)=< f,k> s / (4.2.25) 

for any f G C 24 (R x [0, 1]). We set MC([Q, 1],V(M)) to be the subset of 
C([0, 1],V(1R)) consisting of such solutions. 

Note that h belongs to AiJ-oo iff it can be extended analytically to {z : 
|9(*)|<e}. 
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As a consequence of Lemma 14.151 we find that for any open subset O £ 
C([0, l],V{M)), any v G O n A1C([0, l],V{R)), there exists 5 > small enough 
so that 

¥{fi N eO) > F(d(fi N ,is) < 6) 



> e -N 2 (S°- 1 (v,k)-^<k,k>l- 1 )-g(8)N 2 ^N,k /j 



d{p, N ,v)<S) 



with a function g going to zero at zero. Hence, for any v £ OnA4C([0, 1],V(M)) 

lkninf i^logP^ eO)> -(S - 1 ^,*) - ^ < M >°u A ) = -Sa D {v) 
and therefore 

liminf-^logP^ G O) > inf S„ D . (4.2.26) 

To complete the lower bound, it is therefore sufficient to prove that for any 
v G C([0, l],V(]R)), there exists a sequence v n £ MC([0, 1],V(M)) such that 

lim v n = v and lim Sa D {v n ) = S^ D {v). (4.2.27) 

n — >oo n — »oo 

The rate function S fJ-D is not convex a priori since it is the supremum of 
quadratic functions of the measure-valued path v so that there is no reason 
why it should be reduced by standard convolution as in the classical setting 
(cf. Section ^. 2. lfl . Thus, it is now unclear how we can construct the sequence 
H4.2.27fl . Further, we begin with a degenerate rate function which is infinite 
unless i/o — [id. 

To overcome the lack of convexity, we shall remember the origin of the prob- 
lem; in fact, we have been considering the spectral measure of matrices and 
should not forget the special features of operators due to the matrices structure. 
By definition, the differential equation satisfied by a Hermitian Brownian motion 
should be invariant if we translate the entries, that is translate the Hermitian 
Brownian motion by a self-adjoint matrix. The natural limiting framework of 
large random matrices is free probability, and the limiting spectral measure of 
the sum of a Hermitian Brownian motion and a deterministic self-adjoint ma- 
trix converges toward the free convolution of their respective limiting spectral 
measure. Intuitively, we shall therefore expect (and in fact we will show) that 
the rate function S ' 1 decreases by free convolution, generalizing the fact that 
standard convolution was decreasing the Brownian motion rate function (cf. 
I|4.2.20|l ). However, because free convolution by a Cauchy law is equal to the 
standard convolution by a Cauchy law, we shall regularize our laws by convolu- 
tion by Cauchy laws. Free probability shall be developed in Chapter |SJ 
Let us here outline the main steps of the proof of l|4.2.27|l : 
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1. We find that convolution by Cauchy laws (P c ) e >o decreases the entropy 
and prove (this is very technical and proved in |65p that for any v G 
{S^p < 00} , any given partition = t\ < <2 < • ■ ■ < t n = 1 with 
ti = (i — 1)A, the measure valued path given, for t € [tk, £fc+i[, by 

satisfies 

lim lim S^ X (v eA ) = S ' 1 ^), lim lim z/' A = v 

2. We prove that for v € A, 

A = {^eC([0,l],V(M)) :3e>0; sup ^(|x| 5+e ) < 00}, (4.2.28) 

te[o,i] 

j/ e > A g MC([0, 1],"P(M)) (actually a slightly weaker form since the field 
/i C:A may have time discontinuities at the points {ii,...,t n }, which does 
not affect the uniqueness statement of Lemma l4.15|l . 
Here, the choice of Cauchy laws is not innocent; it is a good choice because 
free convolution by Cauchy laws is, exceptionally, the standard convolu- 
tion. Hence, it is easier to work with. Moreover, to prove the above result, 
it is convenient that ^ t e ' A is absolutely continuous with respect to Lebesgue 
measure with non vanishing density (see (|4.2.31|l ). But it is not hard to 
see that for any measure /i with compact support, ^EBp, if it has a density 
with respect to Lebesgue measure for many choices of laws p , is likely to 
have holes in its density unless J xdp(x) — +00. For these two reasons, 
the Cauchy law is a wise choice. However, we have to pay for it as we shall 
see below. 

3. Everything looks nice except that we modified the initial condition from 
liD into \it) * P £ , so that in fact S^ D {v e ' A ) = +00! and moreover, the 
empirical measure- valued process can not deviate toward processes of the 
form even after some time because these processes do not have finite 
second moment. To overcome this problem, we first note that this result 
will still give us a large deviation lower bound if we change the initial data 
of our matrices. Namely, let, for e > 0, be a N x N diagonal matrix 
with spectral measure converging toward the Cauchy law P e and consider 
the matrix-valued process 

Xf ' e = U N C?XJ* N + D N + U N (t) 

with Uat a N x N unitary measure following the Haar measure on 
U(N). Then, it is well known (see Voiculescu [122] ^ that the spectral 
distribution of U N C?U* N + D N converges toward the free convolution 
P e EH [i D = P t * [i D . 

Hence, we can proceed as before to obtain the following large deviation 

estimates on the law of the spectral measure = jl N N e 

x t ■ 
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Corollary 4.16. For any e > ; for any closed subset F o/C([0, 1],P(M.)), 
limsup -4 lo§ F (A^ e eF)<- mf{S PcniD (i/), i/ G F}. 

JV^oo iV 

Further, for any open set O o/C([0, 1],V(M)), 

liminf 4o !ogF (A^ e O) > 

> -inf{S , p £ * A1D (^),j/GO,^ = P e *^ I Me^ln{S' AII , <oo}}. 

4. To deduce our result for the case e = 0, we proceed by exponential ap- 
proximation. In fact, we have the following lemma, whose proof is fairly 
classical and omitted here (cf. proof of Lemma 2.11). 

Lemma 4.17. Consider, for L £ R + , the compact set Kl ofV(M) given 
by 

K L = {ne V(R); /i(log(x 2 + 1)) < L}. 
Then, on K?(K L ) := ^[0,1] {{Af ' £ e K L } n {ft? € K L }), 

D{f, N <\fi N )<f{N,e) 

where 

limsuplimsup/(7V, e) = 0. 
We then can prove that 

Theorem 4.18. Assume that Ad n converges toward /j,£> while 

SU P t>-D N {x 2 ) < oo. 
jveN 

Then, for any /j,, € A 

lim lkm inf ± log P (D(ji N , M . ) < 6) > -S, D (fx.) 
so that for any open subset O S C([0, 1],V(V(1R))), 

Proof of Theorem 14.181 : Following Lemma 14.131 we deduce that for 
any M £ K + , we can find Lm € M + such that for any L > Lm, 

sup P(/Cf(F L ) c ) <e- MN \ (4.2.29) 

0<£<1 
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Fix M > S f _ tD (fi) + 1 and L > L M . Let S > be given. Next, observe 
that P e * /i. converges weakly toward ft. as e goes to zero and choose 
consequently e small enough so that D(P e * < |. Then, write 

P(^e%J)) > P^(A Ar ,M.)<^A A '' e e B(P £ *^,i),^(K L ] 
> P (V' e e s(P £ * m.. 5)) - nfCe (Kl) c ) 

-P (d{[i n > £ ,Pl n ) > LlC?(K L )) =1-11- III. 



(|4.2.29[1 implies, up to terms of smaller order, that 

II < e^OW^H 1 ). 

Lemma 14.171 shows that III = for e small enough and N large, while 
Corollary 14.161 imply that for any r\ > 0, N large and e > 

J > e -N 2 S Pt ^ D (P e *t,)-N 2 n > ^N^S^M-N^n^ 

Theorem 14. 181 is proved. □ 
5. To complete the lower bound, we need to prove that for any v € {S tlD < 
00}, there exists a sequence of v n A such that 

lim S^iyn) = S^iy), lim v n = (4.2.30) 

n — >oo ' n — >oo 

This is done in by the following approximation : we let, 



Vt' e (dx) = Hd EB at, if t < 77 

= \n D ma n ]^, if ri<t<ri + e 
= [a n ffl ^ t _^_ e ] £ , if t > ry + e 



where, for 6 'P(P), [/J.] 7 ' is the probability measure given for / € Cf,(R) 
by 

:= / /((l + ^ 2 )- Q x)^(x) 



for some a < h. Then, we show that we can choose (ry n ,en)neN so that 
v n = i/j 17 ™' 6 " satisfies (|4.2.30|) . Moreover v n S ^4 for 5(1 — 2a) < 2 because 
sup te [ 01 ] vt{x 2 ) < co as S liD {y) < 00, er t is compactly supported and we 
assumed Hi)(\x\ 5+e ) < 00. 
6. To finish the proof, we need to complete lemmas T4 . 1 51 14 . 1 71 and the points 
1) and 2) (see l|4.2.22J) . (|4.2.23Jl ) of our program. We prove below Lemma 
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14.151 We provide in Chapter El (see sections IFTH1 and Wl\ part of the proofs 
of 1), 2). 

In fact we show in Chapter that S°^(v e ' A ) converges toward S 0,1 (v). 
The fact that the field h e ' A associated with z/ £,A satisfies the necessary 
conditions so that v € > A € MC([0, 1],V(]R)) is proved in [HHI • We shall not 
detail it here but let us just point out the basic idea which is to observe 
that (|4.2.25|) is equivalent to write that, if p t (dx) = pt{x)dx with a smooth 
density p., 

d t p t {x) = -d x {pt{x)Hp t (x) + p t (x)d x k t (x)), 
where Hv is the Hilbert transform 

Hv(x) =PV [ -^—dv{y) = lim / . ^ ~ V > dv(y). 
J x-y e^oj (x-y) 2 + e 2 

In other words, 

a u ( \ 1^ d tPt(y) d y jj i \ fA n q-1 \ 

d x k t {x) = — -j— Hpt(x). (4.2.31) 

Pt(x) 

Hence, we see that d x k, is smooth as soon as p is, that its Hilbert transform 
behaves well and that p does not vanish. To study the Fourier transform 
of d x k t , we need it to belong to ^(dx), which we can only show when 
the original process v possess at least finite fifth moment. More details are 
given in [HSI - 



Proof of Lemma T4. 151 : Following |2§], we take f(x, t) := e iXx in l|4.2.25|l and 

denote by £ t (A) = / e iXx dv t (x) the Fourier transform of v t . v e MC([0, 1],V(M)) 
implies that if k is the field associated with v, |fct(A)| < Ce _e ' A ' with a given 
e > 0. Then, we find that for t e [0 = t 1 ,t 2 ], 

£ t (A) = Co(\)~ f f C s {aX)C s {{l-a)X)dads+iX [ f C s (X+X')k(X' ,s)dX'ds. 
2 Jo Jo Jo J 

(4.2.32) 

Multiplying both sides of this equality by e~i' A ' gives, with £|(A) = e~i' A '£t(A), 

£t(X) = r§(A)-^ f f 1 Cl(aX)Ct((l~a)X)dad S 
z Jo Jo 

+ ixj^ J £ e s (X + X , )e^ x+x '\-^k(X',s)dX / ds. (4.2.33) 

Therefore, if v, v' are two solutions with Fourier transforms C and C respectively 
and if we set A t e (A) = \C\{\) - C e t {X)\, we deduce from (|4.2\33j) (see J^J, proof 
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of Lemma 2.6, for details) that 

Af(A) < X 2 J A £ s (aX)e-^ 1 - a)eX dads 

+C\X\ J* J A e s (X + A')e* |A+V| -* |X| - e|V| iiA / da 
< (C+-)|A| / sup A e s (X')ds + 3t\X\e-^ 

£ JO |A'|<|A| 

where we used that A|(A) < 2e~t |A| . Considering A e t (R) = supi A /|< H A* (A'), 
we therefore obtain 

A t e (i?) < (C + ~)R f A e s (R)ds + MRe-* R . 
e Jo 

By Gronwall's lemma, we deduce that 

At(R) < 3Re~* R e( c+ ^ Rt 

and thus that A^ (oo) = for t < r = j^^q)- By induction over the time, we 
conclude that A|(oo) = for any time t < and then any time t < 1, and 
therefore that v = v. 

□ 

4.3. Discussion and open problems 

We have seen in this section how the non-intersecting paths description (or 
equivalently the Dyson equations l|4.2.11|l ) of spherical integrals can be used to 
study their first order asymptotic behaviour and understand where the integral 
concentrates. 

A related question is to study the law of the spectral measure of the random 
matrix M with law 

d m (M) = _L e -^nv(M)) + NT n AM) dAL 

Zn 

In the case where V(M) = \M 2 , M = W + A with W a Wigner matrix, 
the asymptotic distribution of the eigenvalues are given by the free convolution 
of the semi-circular distribution and the limiting spectral distribution of A. 
A more detailed study based on Riemann-Hilbert techniques gives the limiting 
eigenvalue distribution correlations when p, 1 ^ = ajv<5 a + (1 — ctAr)<5_ a (cf. |19ll3"]). 
It is natural to wonder whether such a result could be derived from the Brownian 
paths description of the matrix. Our result allows (as a mild generalization of the 
next chapter) to describe the limiting spectral measure for general potentials V. 
However, the study of correlations requires different sets of techniques. 
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It would be very interesting to understand the relation between this limit 
and the expansion obtained by B. Collins who proved that 

lim "at2 iffi (^ E N > Djy ) | \—o = a p (p D ,fi E ) 

N — too iV 

with some complicated functions a p (pD, Mb) °f interest. The physicists answer 
to such questions is that 

a-pifJ-D, M-e) = d\ I™ tt? lo g^5v (^ e n, -D/v)U=o := a p (fi D , jj, E ) 

N—>oc iV 

which would validate the whole approach to use the asymptotics of iffi to 
compute and study the a p (fj,D, Ms)'s. However, it is not yet known whether 
such an interchange of limit and derivation is rigorous. A related topic is to 
understand whether the asymptotics we obtained extends to non Hermitian 
matrices. This is not true in general following a counterexample of S. Zelditch 
133 but still could hold when the spectral norm of the matrices is small enough. 
In the case where one matrix has rank one, I have shown with M. Maida [B3"| 
that such an analytic extension was true. 

Other models such as domino tilings can be represented by non-intersecting 
paths (cf. [Sj for instance). It is rather tempting to hope that similar techniques 
could be used in this setting and give a second approach to [SI]. This however 
seems slightly more involved because time and space are discrete and have the 
same scaling (making the approximation by Brownian motions irrelevant), so 
that the whole machinery borrowed from hydrodynamics technology does not 
seem to be well adapted. 

It would be as well very interesting to obtain second order corrections terms 
for the spherical integrals, problem related with the enumeration of maps with 
genus g > 1 as we shall see in the next section. 

Finally, it would also be nice to get a better understanding of the limit- 
ing value of the spherical integrals, namely of Jp^D, H-e)- We shall give some 
elements in this direction in the next section but wish to emphasize already 
that this quantity remains rather mysterious. We have not yet been able for 
instance to obtain a simple formula in the case of Bernouilli measures (which 
are somewhat degenerate cases in this context). 



Chapter 5 



Matrix models and 
enumeration of maps 



It appears since the work of 't Hooft that matrix integrals can be seen, via 
Feynman diagrams expansion, as generating functions for enumerating maps 
(or triangulated surfaces). We refer here to the very nice survey of A. Zvonkin's 
One matrix integrals are used to enumerate maps with a given genus 
and given vertices degrees distribution whereas several matrices integrals can 
be used to consider the case where the vertices can additionally be colored (i.e. 
can take different states). 

Matrix integrals are usually of the form 



with some polynomial function P of d-non-commutative variables and the Lebesgue 
measure dA on some well chosen ensemble of N x N matrices such as the set Hn 
(resp. Sn, resp. Sympssi) of Nx N Hermitian (resp. symmetric, resp. symplectic) 
matrices. 

We shall describe in the next section how such integrals are related with the 
enumeration of maps. Then, we shall apply the results of the previous section 
to compute the first order asymptotics of such integrals in some cases where the 
polynomial function P has a quadratic interaction. 

5.1. Relation with the enumeration of maps 

Following A. Zvonkin's |136j . let us consider the case where d = 1, P(x) — 
tx A + jx 2 and integration holds over H.n. Then, it is argued that 
Conjecture 5.1:[see A. Zvonkin's |136| ] For any N £ N, any i e R 
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g = o 



g 



2 



Fig 5.1. Surfaces of genus g = 0, 2 



with 



C(n,g) = t){maps of genus g with n vertices of degree 4}. 



Here, a map of genus g is an oriented connected graph drawn on a surface of 
genus g modulo equivalent classes. 

To be more precise, a surface is a compact oriented two-dimensional manifold 
without boundary. The surfaces are classified according to their genera, which 
is characterized by a genus (the number of 'handles', see figure l5~T)l . There is 
only one surface with a given genus up to homeomorphism. Following 
Definition 4.1, a map is then a graph which is 'drawn' (or embedded into) a 
surface in such a way that the edges do not intersect and if we cut the surface 
along the edges, we get a disjoint union of sets which are homeomorphic to an 
open disk (these sets are the faces of the map). Note that in Coniecture 15.11 the 
maps are counted up to homeomorphisms (that is modulo equivalent classes). 

The formal proof of Coniecture l5.1l goes as follows. One begins by expanding 
all the non quadratic terms in the exponential 




n>0 



with //at the Gaussian law 



f i N {dA) = Zjv(orV 



fTr(^ )(M 



that is the law of Wigner's Hermitian matrices given by 



A ■ — A - 



Aij = W(0, iV" 1 ), 1 < i < j < N. 



Writing 
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and using Wick's formula to compute the expectation over the Gaussian vari- 
ables, gives the result. 

One can alternatively use the graphical representation introduced by Feyn- 
man to compute such expectations. It goes as shown on figure l5~21 

The last equivalence in figure lH^l results from the observation that ^(Aij) = 
for all i,j and that /j,iy(AijAki) — %=;fc^V _1 , so that each end of the crosses 
has to be connected with another one. As a consequence from this construction, 
one can see that only oriented fat graphs will contribute to the sum. Moreover, 
since on each face of the graph, the indices are constant, we see that each given 
graph will appear _/V^ aces times. Hence, if we denote 

G(n, F) = {oriented graphs with n vertices with degree 4 and F faces} 

we obtain 

n>0 F>0 

Taking logarithm, it is well known that we get only contributions from connected 
graphs : 

log Z N (tA 4 ) = J^h^ G ^ F ) n connected graphs }. 

n>0 F>0 

Finally, since the genus of a map is related with its number of faces by n — F = 
2(g — 1), the result is proved. When considering several matrices, we see that 
we have additionally to decide at each vertex which matrix contributed, which 
corresponds to give different states to the vertices. 

Of course, this derivation is formal and it is not clear at all that such an 
expansion of the free energy exists. Indeed, the series here might well not be 
summablc (this is clear when t < since the integral diverges). In the one matrix 
case, this result was proved very recently by J. Mc Laughlin et N. Ercolani 
who have shown that such an expansion is valid by mean of Riemann-Hilbert 
problems techniques, under natural assumptions over the potential. The first 
step in order to use Riemann-Hilbert problems techniques, is to understand 
the limiting behavior of the spectral measures of the matrices following the 
corresponding Gibbs measure 

»UdAx, ■ ■ ■ , dA d ) = ^— e -^^u- ,A,)) dAl . . . dAd _ 

This is clearly needed to localize the integral. The understanding of such asymp- 
totics is the subject of the next section. 

Let us remark before embarking in this line of attack that a more direct 
combinatorial strategy can be developed. For instance, G. Scheaffer and M. 
Bousquet Melou |22| studied the Ising model on planar random graph as a 
generating function for the enumeration of colored planar maps, generalizing 
Tutte's approach. The results are then more explicitly described by an algebraic 
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We can represent A ili2 A i2i3 A i . ii4 A iiil by an oriented cross 



k 1 I a- k 

k k 

Therefore. A = n" = i A^kA^kA^kA^k is represented by n such crosses 




Since /i,v(A;j) = 0, fiN(AijAki) = N^l^n-, the expectation of A 
will be null except if the indices form an oriented diagram 




Fig 5.2. Computing Gaussian integrals 
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equation for this generating function. However, matrix models approach is more 
general a priori since it allows to consider many different models and arbitrary 
genus, eventhough the mathematical understanding of these points is still far 
from being achieved. 

Finally, let us notice that the relation between the enumeration of maps and 
the matrix models should a priori be true only when the weights of non quadratic 
Gaussian terms are small (the expansion being in fact a formal analytic expan- 
sion around the origin) but that matrix integrals are of interest in other regimes 
for instance in free probability. 

5.2. Asymptotics of some matrix integrals 

We would like to consider integrals of more than one matrix. The simplest inter- 
action that one can think of is the quadratic one. Such an interaction describes 
already several classical models in random matrix theory; We refer here to the 
works of M. Mehta, A. Matytsin, A. Migdal, V. Kazakov, P. Zinn Justin and B. 
Eynard for instance. 

• The random Ising model on random graphs is described by the Gibbs 
measure 

tf sing (dA,dB) = — ^ — e Ar Tr( J 4S)-A r Tr(Pi(yi))-A r Tr(P2(-B))^^^ 

Ising 

with Zj^ sing the partition function 

Zfsing = J e NT ^ AB ^ NT ^ p ^- NT ^ p ^dAdB 

and two polynomial functions Pi , P2 . The limiting free energy for this 
model was calculated by M. Mehta [53| in the case P\{x) = p2(x) — 
x 2 + gx A and integration holds over TLn- However, the limiting spectral 
measures of A and B under nf sing were not considered in that paper. 
A discussion about this problem can be found in P. Zinn Justin |135j . 

• One can also define the q—1 Potts model on random graphs described by 
the Gibbs measure 

Mpotfa,( dA i> -,dA q ) = 
= _i_^ e J V Tr(A 1 A i )-ivTr(p < (A i )) dA . e -JvTr(p I (A 1 )) (iAl . 

Z Potts i—2 

The limiting spectral measures of (Ai , • • • , A g ) are discussed in |135| when 
Pi =gx 3 -x 2 (!). 

• As a straightforward generalization, one can consider matrices coupled by 
a chain following S. Chadha, G. Mahoux and M. Mehta [§31 gi ven by 

^chainidAi, dA q ) = 
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= -J— JJ e ivTV(A,_ 1 A 1 )-^&(P 4 (A < )) dAie - J vT>(i» l (Ax)) dAli 
^ chain i = 2 

q can eventually go to infinity as in |92j . 

The first order asymptotics of these models can be studied thanks to the control 
of spherical integrals obtained in the last chapter. 

Theorem 5.2. Assume that Pi{x) > Cji 4 + di with Ci > and some finite 
constants di. Hereafter, (3 — 1 (resp. [3 = 2, resp. (3 — A) when dA denotes 
the Lebesgue measure on Sn (resp. TLn , resp. TLjq with N even). Then, with 
c = mf„ 6 -p(E) Ip{v), 



F Ismg = \in^-L\ ogZ ? smg 



w£{n(P) + v{Q) - iWQx, v) - - |e(i/)} - 2(5.2.1) 



Fp tts = Jim -^logZg ott 



1 

m{^2 m(Pi) - E /(/5) (^< - § E - « c ( 5 - 2 - 2 ) 

i=l i=2 i=l 

- inf{E Mi (P0 - E I m 0*-UIH) ~ f E - 45-2.3) 

i=l i=2 i=l 



Remark 5.3: The above theorem actually extends to polynomial functions 
going to infinity like x 2 . However, the case of quadratic polynomials is trivial 
since it boils down to the Gaussian case and therefore the next interesting case 
is quartic polynomial as above. Moreover, Theorem 15.41 fails in the case where 
P, Q go to infinity only like x 2 . However, all our proofs would extends easily for 
any continuous functions P(s such that Pj(x) > a|x| 2+e + b with some a > and 
e > 0. In particular, we do not need any analyticity assumptions which can be 
required for instance to obtain the so-called Master loop equations (see Eynard 
and als. ISDI ESI ) - 

Proof of Theorem 15.21 : It is enough to notice that, when diagonalizing the 
matrices Aj's, the interaction is expressed in terms of spherical integrals by 
H3.1.1|) . Laplace's (or saddle point) method then gives the result (up to the 
boundedness of the matrices A^s in the spherical integrals, which can be ob- 
tained by approximation). We shall not detail it here and refer the reader to 

□ 

We shall then study the variational problems for the above energies; indeed, 
by standard large deviation considerations, it is clear that the spectral measures 
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of the matrices (Aj)i<j<d will concentrate on the set of the minimizers defining 
the free energies, and in particular converge to these minimizers when they are 
unique. We prove the following for the Ising model. 

Theorem 5.4. Assume P\(x) > air 4 + b, P2(x) > ax A + b for some positive 
constant a. Then 

0) The infimum in Fi s i ng is achieved at a unique couple (pa,Pb) of proba- 
bility measures. 

1) (Pa>Pb) converges almost surely toward (pa,Pb)- 

2) (fJ>Ai Mb) are compactly supported with finite non- commutative entropy 

= / J log|z - y\dfj,(x)dfj,(y). 

3) There exists a couple (p A ~* B , u A ~* B ) of measurable functions onMx (0, 1) 
such that p A ^ B (x)dx is a probability measure on K for all t G (0,1) and 
(pA, Pb, p A ^ B , u A ^ B ) are characterized uniquely as the minimizer of a strictly 
convex function under a linear constraint. 

In particular, (p A ^ B , u A ^ B ) are solution of the Euler equation for isentropic 
flow with negative pressure p(p) = —^p 3 such that, for all (x,t) in the interior 

o/^{(i,()elx[0,l];p^(x)/0}, 
d tP A ^ B +d x (p^ B u A - B )=0 

d t (p A ^ B u A ^ B ) + d x { P A ^ B {<^ B ? ~ ^(P?^ B ) 3 ) = {b ] 

with the probability measure p A ^ B (x)dx weakly converging toward pA(dx) (resp. 
ps{dx)) as t goes to zero (resp. one). Moreover, we have 

P'(x) -x - ^u A ^ B (x) - ^Hp A (x) = pa - a.s 

(3 (3 
and Q'(x) — x + — u A ^ B (x) — —Hpg(x) — ps — o-.s. 

For the other models, uniqueness of the minimizers is not always clear. For 
instance, we obtain uniqueness of the minimizers for the g-Potts models only 
for q < 2 whereas it is also expected for q = 3. For the description of these 
minimizers, I refer the reader to 

To prove Theorem 15. 41 we shall study more carefully the entropy 



P 

Jp{p, Pd) = g inf-i^MD (f ), ^i = P} 



and in particular understand where the infimum is taken. The main observation 

"2, 
"6 

-fO.l/., t\2 ^ o cO,l/ 



is that by 14.2. 21|) . for any / e C b 2,1 (R x [0, 1] 



S u ->,/)'<2S u -» <f,f>U 
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so that the linear form / — > S ' (l>, f) is bounded and Riesz's theorem asserts 

that there exists k £ Hi = Cf'^R x [0, 1]) < '" > °' 1 such that for any / e C fc 2,1 (E x 
[0,1]) 

S ' 1 ^,/) =</,*>&,!■ (5.2.5) 
We then say that [y, k) satisfies H5.2.5JI . Then 

S°' l {v) = \< k,k>°/= \ J' J(d x k t ) 2 dv t (x)dt. 

Property 5.5. Let /i £ {/i £ V(M.) : E(/x) > -oo} and v. e {5 Mo < oo}. If 
k is the field such that (y,k) satisfies $5.2.5\) . we set ut '■= d x kt(x) + Hvt{x). 
Then, v t {dx) <C dx for almost all t and 



SM = \J J [{u t {x)f + {Hv t (x)f]dv t {x)dt- IfEfa) - S( M ^.6) 



(tit (a;)) dv t {x)dt + — 
6 



dv t (x Jv ., 



dx 



ydxdt 



--(Efa) + £(/«>)). 



Proof : We shall here only prove formula H5.2.6|) . assuming the first part of 
the property which is actually proved by showing that formula (|5.2.6[) yields 
for smooth approximations v e of the measure- valued path v such that S fJiD (v li ) 
approximate S^ D (v) , yielding a uniform control on the L3 norm of their densities 
in terms of Sfj, D (v). This uniform controls allow us to show that any path v £ 
{Sfj_ D < 00} is absolutely continuous with respect to Lebesgue measure and with 
density in L^(dxdt). 

Let us denote by k the field associated with v, i.e such that for any / € 
^(1x10,1]), 



f(x,t)du t {x) 



f(x,s)dv s {x) 



d v f{x, v)dv v {x)ds 



d x f{x,v) - d x f(y,v) 



x-y 



dv v {x)dv v (y)dv 



+ / d x f(x,v)d x k(x,v)du v (x)dv 



(5.2.7) 



with d x k £ L 2 (dv t (x) x dt). Observe that by |119j . p. 170, for any s e [0, 1] such 
that v s is absolutely continuous with respect to Lebesgue measure with density 
p s £ L 3 (dx), for any compactly supported measurable function d x f{.,s), 

x ^ ' ^ x f(y> - dv s (x)dv s {y) = 2 / d x f(x,s)Hv s (x)dxds. 

x-y J 
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Hence, since we assumed that u s {dx) — p s (x)dx for almost all s with p G 
L 3 {dxdt), (|5T7|> shows that for any / e C 24 (M x [0, 1]) 

J f(x,t)p t (x)dx ~ J f(x,s)p s (x)ds = J J d v f(x,v)p v (x)dxds (5.2.8) 

+ J J d x f(x, v)u(x, v)p v (x)dxdv, 

i.e. that in the sense of distributions on K x [0, 1], 

d sPs + d x (u sPs ) = 0, (5.2.9) 
Moreover, since Hv belongs to L 2 (dv s x ds) when p E L 3 (dxdt), we can write 

2S /10 (i/.)=<fc,*>o,i = f I ' {u s {x)) 2 dv s {x)ds+ f ( {Hv s (x)fdv s {x)ds 

JO JR Jo JR 

-2 J J Hv s {x)u s (x)dv s {x)ds (5.2.10) 

We shall now see that the last term in the above right hand side only depends 
on (hq, fi\). To simplify, let us assume that v is a smooth path such that ft{x) = 
/log|x- y\dv t {y) is inC b 2,1 (Rx [0,1]). Then, f5~2~7|l yields 

£(z^) - E(i/ ) = 2 ( ( H(v s ){x)u s {x)dv s {x)ds (5.2.11) 

Jo Jm 

which gives the result. The general case is obtained by smoothing by free con- 
volution, as can be seen in [til] . p. 537-538. 

□ 

To prove Theorem 15. 41 one should therefore write 
Firing = -iaf{C(p,v,p*,m*); 

(4(dx) = p* t {x)dx G C([Q,l},T>(R)),pl = m,Mi = v,dtp* t + d x m* t = 0} 

with 



(3 ( f 1 f K(x)) 2 



7T 



2 r 1 



It is easy to see that 

{li,v,p*,m*) G V{M) 2 x C([0,1],V(M)) x ^((p^x^dxdt) -»■ C{n, v,p*, m*) 
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is strictly convex. Therefore, since Fi S i ng is its infimum under a linear con- 
straint, this infimum is taken at a unique point (pa, Pb, p A ^ B , u A ^ B ). To de- 
scribe such a minimizcr, one classically performs variations. 

The only point to take care of here is that perturbations should be constructed 
so that the constraint remains satisfied. This type of problems has been dealt 
with before. In the case above, I met three possible strategies. 

The first is to use a target type perturbation, which is a standard perturbation 
on the space of probability measure, viewed as a subspace of the vector space 
of measures. The second is to make a perturbation with respect to the source. 
This strategy was followed by D. Serre in jl()8| . The idea is basically to set 
a t (x) = a(t,x) — (pj (x), pi (x)u* (x)) so that the constraint reads div(a t (a;)) = 
and perturb a by considering a family 

a 9 = J g (a.V x ,th) o g = J g (p*(d t h + u*d x h)) o g 

with a C°° diffeomorphism g of [0, 1] xl with inverse h = g^ 1 and Jacobian J g . 
Note here that div(a 9 t (x)) = 0. Such an approach yields the Euler's equation 

The last way is to use convex analysis, following for instance Y. Brenier (see 
|26|. Section 2). These two last strategies can only be applied when we know 
a priori that (p>A, Pb) are compactly supported (this indeed guarantees some a 
priori bounds on (p*,u*p*) for instance). When the minimizers are smooth, all 
these strategies should give the same result. It is therefore important to study 
these regularity properties. 

It is quite hard to obtain good smoothness properties of the minimizers di- 
rectly from the fact that they minimize C An alternative possibility is to come 
back to the matricial integrals and find directly there some of these properties 
(cf. [H2D- What I did in was to obtain directly the following informations 

Property 5.6. 1) If P(x) > a\x\ A + b, Q(x) > a\x\ 4 + b for some a > there 
exists a finite constant C such that for any N £ N, there exists k(N), k(N) 
going to infinity with N , such that 

l T r(Af)<CP, lrr(B 2 /)<CP, p < k(N) 

for fifsmg-almost all (Ajy, Bjy) 

2) There exists a sequence (A^B^r) of matrices such that ft,— (resp.fi— ) 
converges toward /j,a (resp. (ib) and a Wigner matrix ~Kn, independent from 
(Ajv , Bjv) such that (i ~ ~ , converges toward fit(dx) = pl(x)dx . 

*Bjv + (l — t)Ajv + Y*(l — t)X N 

The first result tells us that the polynomial potentials P, Q force the limiting 
laws (pa, Pb) to be compactly supported. The second point is quite natural also 
when we think that we are looking at matrices with Gaussian entries with given 
values at time zero and one; the brownian bridge is the path which takes the 
lowest energy to do that and therefore it is natural that the minimizer of S^ D 
should be the limiting distribution of matrix-valued Brownian bridge. As we 



A. Guionnet/ Large deviations for random matrices 



135 



shall see in Chapter El such a distribution has a limit in free probability as soon 
as ^tB N +(i-t)A N converges for all t G [0, 1], it is nothing but the distribution of 
a free Brownian bridge tB + (1 — t)A + y/t{l — t)S between A and B, with a 
semi-circular variable S, free with (A, B). Note however that unless the joint law 
of (A, B) is given, this result does not describe entirely the law p^. . It however 
shows, because the limiting law is a free convolution by a semicircular law, that 
we have the following 

Corollary 5.7. 1){1a an d f-B are compactly supported. 

2) a) There exists a compact set K CM so that for all t G [0, 1], p* t (K c ) = 0. 
For all t G (0, 1), the support of p^ is the closure of its interior (in particular it 
does not put mass on points) 

b) /4(dx) < dx for all t G (0, 1). Denote p* t (x) = 

c) There exists a finite constant C (independent of t) so that, p,% almost 
surely, 

p*{xf + (Hp* t {x)f K^l-t))- 1 

and 

\u*(x)\<C(t(l-t))-i. 

d) (p*,u*) are analytic in the interior of VL = {x,t elx [0, 1] : p${x) > 0}. 

e) At the boundary of = {x G K : p\ (x) > 0}, for x G fit, 

\p^fd x pl(x)\< ^ t2 l { ^ t? => t {x) <( —^— 
if xq is the nearest point of x in f2£. 

All these properties are due to the free convolution by the semi-circular law, 
and are direct consequences of ^HJ. Once Corollarv l5.7l is given, the variational 
study of F Ising is fairly standard and gives Theorem l5.4l We do not detail this 
proof here. Hence, free probability arises naturally when we deal with the study 
of the rate function S tlD and more generally when we consider traces of matrices 
with size going to infinity. We describe the basis of this rapidly developing field 
in the next chapter. 

5.3. Discussion and open problems 

Matrix models have been much more studied in physics than in mathematics 
and raise numerous open questions. 

As in the last chapter, it would be extremely interesting to understand the 
relation between the asymptotics of the free energy and the asymptotics of its 
derivatives at the origin, which are the objects of primary interest. Once this 
question would be settled, one should understand how to retrieve informations 
on this serie of numbers from the limiting free energy. In fact, it would be 
tempting for instance to describe the radius of convergence of this serie via 
the phase transition of the model, since both should coincide with a default of 
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analyticity of the free energy/the generating function of this serie. However, for 
instance for the Ising model, one should realize that the criticality is reached 
at negative temperature where the integral actually diverges. Eventhough the 
free energy can still be defined in this domain, its description as a generating 
function becomes even more unclear. There is however some challenging works 
on this subject in physics (cf. [2] for instance). 

There are many other matrix models to be understood with nice combinato- 
rial interpretations. A few were solved in physics literature by means of character 
expansions for instance in the work of V. Kazakov, M. Staudacher, I. Rostov 
or P. Zinn Justin. However, such expansions are signed in general and the sad- 
dle points methods used rarely justified. In one case, M. Maida and myself 
could give a mathematical understanding to this method. In general, even the 
question of the existence of the free energies for most matrix models (that is the 
convergence of iV -2 log Z^f(P)) is open and would actually be of great interest 
in free probability (see the discussion at the end of Chapter EJ. 

Related with string theory is the question of the understanding of the full 
expansion of the partition function in terms of the dimension N of the matrices, 
and the definition of critical exponents. This is still far from being understood 
on a rigorous ground, eventhought the present section showed that at least for 
AB interaction models, a good strategy could be to understand better non- 
intersecting Brownian paths. However, it is yet not clear how to concatenate 
such an approach with the technology, currently used for one matrix model 
given in 0^1, which is based on orthogonal polynomials and Riemann-Hilbcrt 
techniques. It would be very tempting to try to generalize precise Laplace's 
methods which are commonly used to understand the second order corrections 
of the free energy of mean field interacting particle systems |2Dj- However, such 
an approach until now failed even in the one matrix case due to the singularity of 
the logarithmic interacting potential (cf. [US]). Another approach to this problem 
has recently been proposed by B. Eynard et all[SniEH| and M. Bertola fTl) . 

On a more analytic point of view, it would be interesting to understand 
better the properties of complex Burgers equations; we have here deduced most 
of the smoothness properties of the solution by recalling its realization in free 
probability terms. A direct analysis should be doable. Moreover, it would be 
nice to understand how holes in the initial density propagates along time; this 
might as well be related with the phase transition phenomena according to A. 
Matytsin and P. Zaugg 



Chapter 6 



Large random matrices and 
free probability 

Free probability is a probability theory for non-commutative variables. In this 
field, random variables are operators which we shall assume hereafter selfad- 
joint. For the sake of completeness, but actually not needed for our purpose, we 
shall recall some notions of operator algebra. We shall then describe free prob- 
ability as a probability theory on non-commutative functionals, a point of view 
which forgets the space of realizations of the laws. Then, we will see that free 
probability is the right framework to consider large random matrices. Finally, 
we will sketch the proofs of some results we needed in the previous chapter. 

6.1. A few notions about von Neumann algebras 

Definition 6.1 (Definition 37, [16|). A C* -algebra (A,*) is an algebra 
equipped with an involution * and a norm \ \.\\a which furnishes it with a Banach 
space structure and such that for any X, Y € A, 

||XY|U< ||X|U ||Y|U, ||X*|U = ||X|U, ||XX*|U = ||X|ft. 

X G A is self-adjoint iff X* = X. A sa denote the set of self-adjoint elements of 
A. A C* -algebra (A, *) is said unital if it contains a neutral element denoted /. 

A can always be realized as a sub-C*-algebra of the space B{H) of bounded 
linear operators on a Hilbert space H . For instance, if A is a unital C*-algebra 
furnished with a positive linear form r, one can always construct such a Hilbert 
space H by completing and separating L 2 (t) (this is the Gelfand-Naimark-Segal 
(GNS) construction, see [TTS1, Theorem 2.2.1). 

We shall restrict ourselves to this case in the sequel and denote by H a Hilbert 
space equipped with a scalar product < ., . >h such that A C B(H). 

Definition 6.2. If A is a sub-C* -algebra of B(H), A is a von Neumann al- 
gebra iff it is closed for the weak topology, generated by the semi-norms family 
{p S ,„(X) =< X£,t? > H ,^V e H}. 

137 
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Let us notice that by definition, a von Neumann algebra contains only bounded 
operators. The theory nevertheless allows us to consider unbounded operators 
thanks to the notion of affiliated operators. An operator X on H is said to be 
affiliated to A iff for any Borel function / on the spectrum of X, /(X) E A (see 
|l()4j . p. 164). Here, /(X) is well defined for any operator X as the operator 
with the same eigenvectors than X and eigenvalues given by the image of those 
of X by the map /. Note also that if X and Y are affiliated with A, aX + bY 
is also affiliated with A for any a, b £ R. 

A state r on a unital von Neumann algebra (A, *) is a linear form on A such 
that T(Asa) C R and 

1. Positivity r(AA*) > 0, for any A e A. 

2. Total mass r(7) = 1 

A tracial state satisfies the additional hypothesis 

3. Traciality r(AB) = r(BA) for any A,B e A 

The couple (A, r) of a von Neumann algebra equipped with a state r is called 
a W*- probability space. 

Example 6.3. 1. Let n € N, and consider A = M n (C) as the set of bounded 
linear operators on C™. For any v e C™, ||l»||c" = 1, 

t v (M) =< v,Mv >o 

is a state. There is a unique tracial state on M n (C) which is the normal- 
ized trace 

n 

tr{M) = -V M u . 
n * — ' 

i=l 

2. Let (X,T,,dfi) be a classical probability space. Then A = L°°(X, E,d/x) 
equipped with the expectation r(/) = J fdfi is a (non-) commutative prob- 
ability space. Here, L°°{X, E, dfj.) is identified with the set of bounded lin- 
ear operators on the Hilbert space H obtained by separating L 2 (X, E,d/i) 
(by the equivalence relation f ~ g iff fx((f — g) 2 ) = 0). The identi- 
fication follows from the multiplication operator M{f)g = fg. Observe 
that it is weakly closed for the semi-norms (< /, .g >H,f,9 € i 2 (/i)) as 
L°°(X, E,dn) is the dual of L 1 (X,T,,dfi). 

3. Let G be a discrete group, and {eh)heG be a basis of £ 2 (G). Let \(h)e g = 
eh g ■ Then, we take A to be the von Neumann algebra generated by the lin- 
ear span of X(G). The (tracial) state is the linear form such that r(\(g)) = 
l g=e (e = neutral element). 

We refer to ]128$ for further examples and details. 

The notion of law 7Xi,...,x m 01 m operators (Xi, . . . , X m ) in a W* -probability 
space (^4, r) is simply given by the restriction of the trace r to the algebra 
generated by (Xi, . . . , X m ), that is by the values 
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r Xl ,...,x m (P) = r(P(Xi, . . . , X m )), VP G C(Xx, . . . X m ) 

where C{X\, . . . X m ) is the set of polynomial functions of m non-commutative 
variables. 

6.2. Space of laws of m non-commutative self-adjoint variables 

Following the above description, laws of m non-commutative self-adjoint vari- 
ables can be seen as elements of the set of linear forms on the set of 
polynomial functions of m non-commutative variables C(X±, . . . X m ) furnished 
with the involution 

(Xi 1 Xi 2 ■ ■ ■ Xi n ) = Xi n Xi n _ 1 ■ ■ ■ Xi 1 

and such that 

1. Positivity t(PP*) > 0, for any P G C(Xi, . . . X m ). 

2. Traciality t(PQ) = t(QP) for any P, Q G C(X t , . . . X rn ). 

3. Total mass r(J) = 1 

This point of view is identical to the previous one. Indeed, by the Gelfand- 
Naimark-Segal construction, being given /i G A4^ m \ we can construct a W*- 
probability space (A, r) and operators (Xi, • • • ,X m ) such that 

// -x x • (6.2.1) 

This construction can be summarized as follows. Consider the bilinear form on 
C(Xi,...X ro ) 2 given by 

< P 1 Q> t =t(PQ*). 
We then let H be the Hilbert space obtained as follows. We set L 2 (t) = 

ML I 

C(X±, . . . X m ) T to be the set of C(Xi, . . . X m ) for the norm ||.|| T =< ., . >|. 
We then separate L 2 (t) by taking the quotient by the left ideal 

L, = {FeL 2 (r):\\F\\ T = 0}. 

Then H = L 2 (t)/L^ is a Hilbert space with scalar product < ., . > T . The non- 
commutative polynomials C(X±, . . . X m ) act by left multiplication on L 2 (t) and 
we can consider the completion of these multiplication operators for the semi- 
norms {< P, .Q >h, P,Q G L 2 (t)}, which form a von Neumann algebra 
A equipped with a tracial state r satisfying (|6.2.1|l . In this sense, we can think 
about A as the set of bounded measurable functions L°°(t). 

The topology under consideration is usually in free probability the C(Xi, . . . X m )*- 
topology that is (tx™,...,x™ ,m G N) converges toward Tx lr ...x,„ iff for every 

PeC(4...x m ), 

lim rx"..,x» {P) =rxi,...,x m (-P). 
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If (X", . . . ,X™j) n gN are non-commutative variables whose law tx™,...,x« con- 
verges toward rx 1 ,...,x,„, then we shall also say that (X™, . . . , X^)„ s n converges 
in law or in distribution toward (Xi, . . . , X m ). 

Such a topology is reasonable when one deals with uniformly bounded non- 
commutative variables. In fact, if we consider for R 6 M + , 

M { ™ ] := {fi E M {m) : n(Xf p ) < R p , Mp e N, 1 < i < m} 

then it is not hard to see that equipped with this weak-* topology, is a 

Polish space (i.e. a complete metric space). A distance can for instance be given 

by 

where P n is a dense sequence of polynomials with operator norm bounded by 
one when evaluated at any set of self-adjoint operators with operator norms 
bounded by R. 

This notion is the generalization of laws of m real-valued variables bounded 
say by a given finite constant R, in which case the weak-* topology driven by 
polynomial functions is the same as the standard weak topology. Actually, it is 
not hard to check that M.^ = V([—R, R]). However, it can be usefull to consider 
more general topologies compatible with the existence of unbounded operators, 
as might be encountered for instance when considering the deviations of large 
random matrices. Then, the only point is to change the set of test functions. In 
|29| . we considered for instance the complex vector space CC^(C) generated by 
the Stieljes functionals 

— > m 

ST m (C) = { [] (*i - J2 a?**) -1 ! * e C \ R > a " e Q, n e N} (6.2.2) 

l<i<n k=l 

It can be checked easily that, with such type of test functions, M.( m ) is again a 
Polish space. 

Example 6.4. Let N 6 N and consider m Hermitian matrices , • • • , € 
"H 7 ^ with spectral radius WAfW^ < R, 1 < i < m. Then, set 

Clearly, p,^ N AN € M.^. Moreover, if (A^ , • • • , j4^)jv g n is a sequence such 
that 

Jim Aa« ... = T ( P ): VP e C (^l' ' ' ' X m) 

then t S Mr since is complete. 

It is actually a long standing question posed by A. Connes to know whether 
all t G M. can be approximated in such a way. 

In the case m = 1, the question amounts to ask if for all [i € V{[— R, R]), 
there exists a sequence (A^ , • • • , A^)at s n such that 
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1 N 

lim — > 5\n = u. 

i=l 



This is well known to be true by Birkhoff's theorem (which is based on Krein- 
Milman's theorem), but still an open question when m > 2. 



6.3. Preeness 



Free probability is not only a theory of probability for non-commutative vari- 
ables; it contains also the central notion of freeness, which is the analogue of 
independence in standard probability. 

Definition 6.5. The variables (Xi, . . . ,X m ) and (Yi, . . . , Y n ) are said to be 
free iff for any {PuQi)\<i< n € (C(-X"i, • • • ,X m ) x C(Xi, . . . , X n )) n , 

ri f[ P i (X 1 ,...,X m )Q i (Y 1 ,...,Y n ) ] =0 

as soon as 



r(P i (X 1 ,... ! X m ))=0, r(Qi(Yi,...,Y n )) = 0, Vie {l,...,n}. 
Remark 6.6: 

1) The notion of freeness defines uniquely the law of {Xi , . . . , X m , Yi , . . . , Y n } 
once the laws of (Xi, . . . , X m ) and (Yi, . . . , Y„) are given (in fact, check that 
every expectation of any polynomial is given uniquely by induction over the 
degree of this polynomial) . 

2) If X and Y are free variables with joint law r, and P,Q E C(X) such 
that r(P(X)) = and r(Q(Y)) = 0, it is clear that r(P(X)Q(Y)) = as it 
should for independent variables, but also t(P(X)Q(Y)P(X)Q(Y)) = which 
is very different from what happens with usual independent commutative vari- 
ables where /x(P(X)Q(Y)P(X)Q(Y)) = ^(P(X) 2 Q(Y) 2 ) > 0. 

3) The above notion of freeness is related with the usual notion of freeness 
in groups as follows. Let (xt, ..x m , yi, ■ ■ ■ ,y n ) be elements of a group. Then, 
(xi, • ■ ■ ,x m ) is said to be free from ■ ■ ■ ,y n ) if any non trivial words in 
these elements is not the neutral element of the group, that is that for ev- 
ery monomials Pi, • • • , P k £ C(Xi, • • • , X m ) and Qi, • • • , Q k e C(Xi, • • • , X n ), 
Pi(x)Qi(y)Pz(x) ■ ■ ■ Qk(y) is not the neutral element as soon as the Qk{y) and 
the Pi(x) are not the neutral element. If we consider, following example 16. 31 3). 
the map which is one on trivial words and zero otherwise and extend it by lin- 
earity to polynomials, we see that this define a tracial state on the operators 
of left multiplication by the elements of the group and that the two notions of 
freeness coincide. 

4) We shall see below that examples of free variables naturally show up when 
considering random matrices with size going to infinity. 
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6.4. Large random matrices and free probability 

We have already seen in example It). 31 that if we consider {M^ , . . . , M^) € Ti^, 
their empirical distribution m n given for any P G C(Xi, . . . , X m ) by 

... ,Af£ ( P ) : = tr ( P (K, • • ■ > )) ■ 

belongs to .M^" 1 ). 

Moreover, if we take a sequence {(M-f, . . . , M£) £ H^} NeN such that for 
any P € C(Xi, . . . , X m ) the limit 

r(P) := lim tr (P(Aff, . . . , Af£)) 

exists, then t S A / l'- m -'. Hence, is the natural space in which one should 

consider large matrices, as far as their trace is concerned. As we already stressed 
in example 16. 41 it is still unknown whether any element of My"^ can be approx- 
imated by a sequence of empirical distribution of self-adjoint matrices in the 
case m > 2. Reciprocally, large random matrices became an important source 
of examples of operators algebras. 

The fact that free probability is particularly well suited to study large ran- 
dom matrices is due to an observation of Voiculescu |121) who proved that if 
(Ajv, Bjv)./veN is a sequence of uniformly bounded diagonal matrices with con- 
verging spectral distribution, and Ujy a unitary matrix following Haar measure 
m,2 i then the empirical distribution of (Ajy, UatBjvU^) converges toward the 
law of (A, B), A and B being free and each of their law being given by their lim- 
iting spectral distribution. This convergence holds in expectation with respect 
to the unitary matrices Un and then almost surely (as can be checked by 
Borel-Cantelli's lemma and by controlling 

/(At.u.B.u-^)-/^ B B T> (P)dm?(U N )) 2 dm?(XJ N ) as N goes to 
infinity) . 

As a consequence, if one considers the joint distribution of m independent 
Wigner matrices (X^,- ■ • , X^), their empirical distribution converges almost 
surely toward (Si, • • • , S m ), m free variables distributed according to the 

semi-circular law a(dx) = CV 4 — x 2 dx. 

Hence, freeness appears very naturally in the context of large random matri- 
ces. The semi-circular law a, which we have seen to be the asymptotic spectral 
distribution of Gaussian Wigner matrices, is in fact also deeply related with the 
notion of freeness; it plays the role that Gaussian law has with the notion of 
independence in the sense that it gives the limit law of the analogue of central 
limit theorem. Indeed, let (A, t) be a W* -probability space and {X,, i eN}ei 
be free random variables which are centered (r(Xj) = 0) and with covariance 
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1 (r(Xf) = 1). Then, the sum \/n (Xi + • ■ • + X„) converges in distribution 
toward a semi-circular variable. We shall be interested in the next section in the 
free Ito's calculus which appeared naturally in our problems. 

Before that, let us introduce the notation for free convolution : if X (resp. 
Y) is a random variable with law /x (resp v) and X and Y are free, we denote 
fiBi/ the law of X + Y. There is a general formula to describe the law fj, EB v\ 
in fact, analytic functions were introduced by Voiculescu as an analogue of 
the logarithm of Fourier transform in the sense that 

RtMuiz) = Rn(z) + R v {z) 

and that i? M defines [i uniquely. I refer the reader to |128j for more details. 
Convolution by a semicircular variable was precisely studied by Biane |15| . 

6.5. Free processes 

The notion of freeness allows us to construct a free Brownian motion such as 

Definition 6.7. A free Brownian motion {St,t > 0} is a process such that 

1) S = 0. 

2) For any t > s > 0, St — S s is free with the algebra o~(S u , u < s) generated 
by {S u ,u < s). 

3) For any t > s > 0, the law of St — S s is a semi- circular distribution with 
covariance s — t ; a s -t(dx) — (ir(s — t)) y4(s — t) — x 2 dx. 

The Hermitian Brownian motion in particular converges toward the free 
Brownian motion according to the previous section (using convergence on cylin- 
der functions). 

As for the Brownian motion, one can make sense of the free differential equa- 
tion 

dX t = dS t + b t (X t )dt 

and show existence and uniqueness of solution to this equation when b t is Lips- 
chitz operator in the sense that if | . | denotes the operator norm on the algebra 
A (\\A\\ = \\m.T{A 2n ) on which the free Brownian motion {St,t > 0} lives, 

\\bt(X)-b t (Y)\\<C\\X-Y\\ 

with a finite constant C(one just uses a Picard argument). 

With the intuition given by stochastic calculus, we shall give some outline of 
the proof of some results needed in Chapter 

6.6. Continuity of the rate function under free convolution 

In the classical case, the entropy S of the deviations of the law of the empir- 
ical measure of independent Brownian motion decreases by convolution (see 
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H4.2.2()|l ). We want here to generalize this result to our eigenvalues setting. The 
intuition coming from the classical case, adapted to the free probability setting, 
will help to show the following result : 

Lemma 6.8. For any p G P(M), any v e C([0, 1},V(R)) 

S°<Vfflp) < 

Proof : We shall give the philosophy of the proof via the formula (see 115.2.511 1 



Namely, let us assume that v t can be represented as the law at time t of the free 
stochastic differential equation (FSDE) 



It can be checked by free Ito's calculus that this FSDE satisfies the same free 
Fokker-Planck equation 1)5.2.5(1 (get the intuition by replacing S by the Her- 
mitian Brownian motion and X by a matrix- valued process). However, until 
uniqueness of the solutions of (|5.2.5|l is proved, it is not clear that v is indeed 
the law of this FSDE. 

Now, let C be a random variable with law p, free with S and Xq. Y = X + C 
satisfies the same FSDE 



where r( \X t + C) is the orthogonal projection in L 2 (t) on the algebra generated 
by X t + C (recall the definition of L 2 (t) given in Section IH75I) . From this, we see 




dX t = alS t + k t (X t )alt 



dYt = alSt + d x k t {Xt)dt 



and therefore its law /i t 



vt^Bp satisfies for any 

C [o,i]), 
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that 

S ' 1 ^) = sup {S°'W)-^</,/>£i} 

fec 2 b \rx[o,i]) 

sup { f T(d x f t {X t + C)r(d x k t (X t )\X t + C))dt 
/ee^ORxpu]) Jo 

-\f Q r[{d x f t {X t + C)f)dt} 

i r 1 

- 2 Jo r ( r (^ fc *(^)l X * + C ') 2 )^ 

< J f T{d x k t {x t ) 2 )dt = s^{v) 

1 Jo 

Of course, such an inequality has nothing to do with the existence and unique- 
ness of a strong solution of our free Fokker-Planck equation and we can indeed 
prove this result by means of i?-transform theory for any v € {S ' 1 < oo} (see 

EDI)- 

□ 

6.7. The infimum of is achieved at a free Brownian bridge 

Let us state more precisely the theorem obtained in this section. A free Brownian 
bridge between /io and /ii is the law of 

X t = (1 - t)X + tX x + y/t(l-t)S (6.7.3) 

with a semicircular variable S, free with X and X\, with law /i and fj,i respec- 
tively. We let FBB(// ,Mi) C C([0, 1],P(M)) denote the set of such laws (which 
depend of course not only on /xoiMi but on the joint distribution of (Xq,Xi) 
too). Then, we shall prove (cf. Appendix 4 in |HJ that 

Theorem 6.9. Assume fiOifJ-i compactly supported, with support included into 

[-R,R\. 

1 ) Then, 

Jp(f4>,(*i) = ^inf{SV);zA) = 14), Vi = Mi} 
= |inf{S(i/);i/efBB(/xo,Mi)}- 

2j Since FBB(fiQ, fix) is a closed subset of C([0, 1], V(M)), the unique mini- 
mizer in J^(/j,q, fii) belongs to FBB(/j,o, fix). 
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The proof of Theorem 16. 91 is rather technical and goes back through the large 
random matrices origin of J p. Let us consider the case (3 = 2. By definition, if 

X t = X +H t 

with a real diagonal matrix Xq with spectral measure fig and a Hermitian 
Brownian motion H , if we denote /t^ the spectral measure of , then, if 
fiQ converges toward a compactly supported probability measure fiQ, for any 

limsuplimsup-^-logP((i(/if / ,^i) < 6) < - mf-fS^ (m); Ml = Ml}- 

5-0 N— oo J* 

Let us now reconsider the above limit and show that the limit must be taken at 
a free Brownian bridge. More precisely, we shall see that, if r denotes the joint 
law of (Xq,X\) and \i T the law of the free Brownian bridge l|6. 7.3(1 associated 
with the distribution t of (Xo,Xl), 

limsuplimsup — ^ logP(c?(/i^ v , mi) < S) 

5-0 TV— oo N 

< sup limsuplimsup — -logPf max d(u^,fij ) < S) 

„x->=,„ 5-0 JV-oc N 2 l<k<n k k 

roX~ 1 =M1 

for any family {£].,•■• , t n } of times in [0, 1]. Therefore, Theorem l4.5l 2').a % ) implies 
that 

limsup limsup — r logP(d(/i^ v , /ii) < 5) 

5-0 W— co N 

< M{S(p T ),T o X^ 1 = //o,T Xf 1 = Mi}- 

The lower bound estimate obtained in Theorem 14 . 51 2 ) . b) therefore guarantees 
that 

mf{S(v),v = Mo, v\ = Mi} > w£{S(fJ, T ), t o X^ 1 = Mo, t o Xf 1 = /^i}. 

and therefore the equality since the other bound is trivial. 

Let us now be more precise. We consider the empirical distribution Mo'i — 
Avn vn of the couple of the initial and final matrices of our process as an 

(2) 

element of A4\ , equipped with the topology of the Stieljes functionals. It is 

(2) 

not hard to see that M\ is a compact metric space by the Banach Alaoglu 
theorem. Let D be a distance on M \ ' . 

(2) 

Then, for any e > 0, we can find M S N, (rk)i<k<M so that A4\ C 
Ui<fc<j\/{r : D(t, Tfc) < e} and therefore 

lim sup lim sup — ^ log F(d(p,^ , Mi ) < 

5—0 W— oo N 

< max . limsup-^2 logP(d(/if ,Mi) < 6; Dipg^Tk) < e) 

l</c<M IV 
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Now, conditionally on X N 



1 ! 



dX t N = dH? - ^ ^dt 

1 4 l-t 



or equivalently 

X t N = tX* + (1 - t)X£ + (1 - t) [\l - s)" 1 ^ 



It is not hard to see that when p,Q 1 converges toward r, fi^ N converges toward 
if t for all t £ [0,1]. 

Therefore, for any n > 0, any ti,- • • , t n G [0, 1], there exists e > such that 
for any (X<f,Xf) € T) < e}, 

P(max d( / ^„, M [J>, 7 |X 1 Ar )< K 

Hence for any 77, when e is small enough and N large enough, taking k = |, 
P(d(Af,A*i) r)<e) 
< 2P(d(/xf , mi) < S; X?(AS[i,t) < e, max ,/4J < v)- 

We arrive at, for e small enough and any r € A4o,i i 

limsup-^ logP(d(Af,/ii) < r) < e) 

< limsup-^2 logP( max d(/i^,^J < 5). 

Using the large deviation upper bound for the law of (At ; t G [0, 1]) of Theorem 
14.51 21. we deduce 



limsup-^logP(d(/if < <*) < --mm inf ^(i/) 



We can now let e going to zero, and then S going to zero, and then n going to 
infinity, to conclude (since S is a good rate function) . It is not hard to see that 
FBB(^io.Mi) is closed (see p. 565-566 in |61|1. 



Chapter 7 



Voiculescu's 

non- commutative entropies 

One of the motivations to construct free probability theory, and in particu- 
lar its entropy theory, was to study von Neumann algebras and in particular 
to prove (or disprove) isomorphisms between them. One of the long standing 
questions in this matter is to see whether the free group factor L(F m ) with m 
generators is isomorphic with the free group with n generators when n =/= m. 
Such a problem can be rephrased in free probability terms due to the remark 
that if X\, • ■ ■ , X m (resp. Y\, ■ ■ ■ , Y m ) are non-commutative variables with law 
tx and ty respectively, then tx = Ty =>• W*(Xi, ■ • ■ , X m ) ~ W*(Yi, ■ • • , Y m ). 
Therefore, 

W*{X X ,--- ,X m )^W*{Y 1 ,--- ,Y m )^T X =r Y 
where tx = Ty when there exists F E L°°(tx), G G L°°(ty) such that 

r Y (P)=F # Tx(P) = r x (PoF) t x (P) = G # t y (P) VPeC(X u --- ,X m ). 

Here, L°°(tx) and L°°(ry) denote the elements of the von Neumann algebras 
obtained by the Gelfand-Naimark-Segal construction. Therefore, if a m is the 
law of m free semi-circular variables Si,.., S m , L(F m ) ~ W*(Si, ■ ■ ■ , S m ) and 

L(F m ) ~ L[F n ) o W*{Si,-- ■ ,S m ) ~ W*{Si,-- ■ , S n ) ^ a m = a n . 

The isomorphism problem is therefore equivalent to wonder whether <j m = a n 
implies m = n ov not. 

Note that in the classical setting, two probability measures on M." 1 and M." 
are equivalent (in the sense above that they are the push forward of each other) 
provided they have no atoms and regardless of the choice of m, n € N. In the non- 
commutative context, one may expect that such an isomorphism becomes false 
in view for instance of the work of D. Gaboriau [55) in the context of equivalence 
relations. However, such a setting carries more structure due to the equivalence 
relations than free group factors; it concerns the case of algebras with Cartan 
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sub-algebras (which are roughly Abelian sub- algebras) whereas D. Voiculescu 
|124j proved that every separable i/i-factor has no Cartan sub-algebras. 

To try to disprove the isomorphism, one would like to construct a function 
S : M.( m > — > K which is an invariant in the sense that for any r, f £ M( m ) 

t=t^5{t) = 6(t) (7.0.1) 

and such that 5(ak) = k (where o^, k < m, can be embedded into for 
k < m by taking m — k null operators). D. Voiculescu proposed as a candi- 
date the so-called entropy dimension 8, constructed in the spirit of Minkowski 
dimension (see (|7.1.3|) for a definition). It is currently under study whether 5 
is an invariant of the von Neumann algebra, that is if it satisfies ()7.0.1J) . We 
note however that the converse implications is false since N. Brown just 
produced an example showing that there exists a von-Neumann algebra which 
is not isomorphic to the free group factor but with same entropy dimension. 
Eventhough this problem has not yet been solved, some other important ques- 
tions concerning von Neumann algebras could already be answered by using this 
approach (see |124) for instance). In fact, free entropy theory is still far from 
being complete and the understanding of these objects is still too narrow to be 
applied to its full extent. In this last chapter, we will try to complement this 
theory by means of large deviations techniques. 

We begin by the definitions of the two main entropies introduced by D. 
Voiculescu, namely the microstates entropy x an d the microstates-free entropy 
X* ■ The entropy dimension S is defined via the microstates entropy x but we 
shall not study it here. 

7.1. Definitions 

We define here a version of Voiculescu's microstates entropy with respect to 
the Gaussian measure (rather than Lebesgue measure as he initially did : these 
two points of view are equivalent (see |30|) but the Gaussian point of view is 
more natural after the previous chapters). I underlined the entropies to make 
this difference, but otherwise kept the same notations as Voiculescu. 

For fi e M { l n \ n £ N, N G N, e > 0, Voiculescu defines a neighborhood 
n, N, e) of the state /i as the set of matrices Ai, .., A m of such that 

|/x(X il ..X ip )-tr(A il ..A ip )| <e 

for any 1 <p < n, i\, ..,i p € {1, ..,m} p and HAjHoo < R. Then, the microstates 
entropy w.r.t the Gaussian measure is given by 

X(/x) := sup inf inf limsup iV" 2 log^" 1 ^^, n, iV, e)] 

— iJ>0™e Ne>0 Af—oo 

This definition of the entropy is done in the spirit of Boltzmann and Shannon. 
In the classical case where M.™ is replaced by 'P(R) the entropy with respect 
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to the Gaussian measure 7 is the relative entropy 
S(p) := inf lim sup TV" 1 log 7® w f d(l V ^ , ^) < e ) 

:= inf hm sup TV- 1 log 7^ ( |1 V - [i(f p )\ < e,p < k ) 

where the last equality holds if (/ P ) P eN is a family of uniformly continuous 
functions, dense in (R). This last way to define the entropy is very close from 
Voiculescu's definition. In the commutative case, we know by Sanov's theorem 
that 

1. The limsup in the definition of S can be replaced by a liminf, i.e. 

S(fi) := inf lim inf iV^ 1 log 7® m ( d(— Vfe ,u) < e] . 

e>0 N-^oc \ N — ' * / 

\ i=l / 

2. For any /1 <E 'P(K) we have the formula 

= -S*(p) 

with S* the relative entropy which is infinite if fi is not absolutely contin- 
uous wrt 7 and otherwise given by 

S*(M)= / ^(x)log^(x)d 7 (x). 

These two fundamental results are still lacking in the non-commutative theory 
except in the case m = 1 where Voiculescu |123| (see also ^U]) has shown that 
the two limits coincide and that 

xip) = — — J x 2 d/i(x) + constant 

with 

£(ju) = / / log \x- y\dfi(x)dfj,(y). 



In the case where m > 2, Voiculescu |122II125| proposed an analogue of S*, 
which does not depend at all on the definition of microstates and called for that 
reason micro-states- free entropy. The definition of \* , is based on the notion 
of free Fisher information. To generalize the definition of Fisher information to 
the non-commutative setting, D. Voiculescu noticed that the standard Fisher 
information can be defined as 
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with d* the adjoint of the derivative d x for the scalar product in L 2 (p), that is 
that for every / € L 2 (p), 

J fd* x ldp{x) = fd x fdn(x). 

When p has a density p with respect to Lebesgue measure, we simply have 
8*1 = — 9 x logp = —p~ 1 d x p. The entropy S* is related to Fisher information 
by the formula 

S*{n) = \j\{n b t )dt 

with p\ the law at time t of the Brownian bridge between So and p. 

Such a definition can be naturally extended to the free probability setting 
as follows. To this end, we begin by describing the definition of the derivative 
in this setting; let P 6 C(Xi,...,X m ) be for instance a monomial function 
Ili</c<r x ik ■ Then, for any operators Xi, ■ • • , X m and Yi, ■ • • , Y m , 

P(X + eY)-P(X) = eJ2 f[ f[ X ll+ 0(e 2 ). (7.1.2) 

fc=ll<j<fc-l k+l<l<r 

In order to keep track of the place where the operator Y have to be inserted, the 
derivative is defined as follows; the derivative D Xi with respect to the i th vari- 
able is a linear form from C{Xi , . . . , X m ) into C(Xi , . . . , X m ) (8 C(Xi , . . . , X m ) 
satisfying the non-commutative Leibniz rule 

D Xl PQ = D Xz P x 1®Q + P®lx D Xi Q 

for any P,Q S C(Xi, . . . , X m ) and 

D Xi X; = l /=i l ® 1. 

One then denotes jj the application from C(Xi, . . . , X m ) ® C(Xi, . . . , X TO ) x 
C(Xi, . . . , X m ) into C(Xi, . . . , X m ) such that A ® BjJC = ACS. It is then not 
hard to see that (|7. 1.211 reduces to 

m 

P(X + eY) - P(X) = eY,D Xj P$Yj + 0(e 2 ). 

3 = 1 

The cyclic derivative T> Xi with respect to the i-th variable is given by 

T^Xi = tn ° D Xi 

where m : C{X-y, ■ ■ ■ ,X n ) ®C(Xi ■ ■ ■ X n ) — > C(Xi, ■ • • , is such that m(P<8> 
Q) = QP- In the case m = 1, using the bijection between C(X) (8> C(X) and 
C(X,Y), we find that 

x-y 
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The analogue of 9*1 in L 2 (/i) is given, for any r G M.( m \ as the element in 
L 2 (t) such that for any P e C(Xi, . . . , X m ), 

T®T{D Xi P)=T{PD* x .l). 

In the case m = 1 and so r e V(M), it is not hard to check (see |123p that 

D* x l(x) = 2PV (^J{x-y)- l dr{ V )\ . 
Free Fisher information is thus given, for r G M {m \ by 

m 

**(t)=x;ii^iiiW) 

i=l 

and therefore x* is defined by 

X» :=-\J^*{n h t )dt 

with /x' the distribution of the free Brownian bridge, fjfy = C(tXi+y/t{\ — t)Si, 1 < 
i < m) if fi — C{Xi, 1 < i < m) and Si are free standard semi-circular variables, 
free with (Xi, 1 < i < m). 

The conjecture (named 'unification problem' by D. Voiculescu |129| ~l is 
Conjecture 7.1: For any /i such that x(m) > — °°j 

x(m) = x*(m)- 

Remark 7.2: If the conjecture would hold for any /x G ./Vr m ' and not only for fi 
with finite microstates entropy, it would provide an affirmative answer to Connes 
question since it is known that if fi is the law of X\ , • • • , X m and Si , • • • , S m free 
semicircular variables, free with Xi, • • • , X m then, for any e > the distribution 
[i EE cr e of (Xi + eSi, • • • , X m + eS m ) satisfies x*(m EB cr c ) > — oo, and hence the 
above equality would imply x (/i EB cr e ) > — oo so that one could find matrices 
whose empirical distribution approximates /i EB o~ e and thus fi since e can be 
chosen arbitrarily small. 

In ^2], we proved that 

Theorem 7.3. For any fi G M^ m \ 

X(/i) < x*(m)- 

Moreover, we can define another entropy x** such that 
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Typically, \* is obtained as an infimum of a rate function on lows of non- 
commutative processes with given terminal data, whereas x** is the infimum of 
the same rate function but on a a priori smaller set. 

From this result, we as well obtain bounds on the entropy dimension 

Corollary 7.4. Let for r G 

xf \ v X(rma e ) x(t ffl a e ) 

d(T) := m — nmsup — : = m — limsup — : (7.1.o) 

e ->o loge £-^o loge 

where r ffl a e stands for the free convolution by m free semi-circular variables 
with parameter e > 0. Define accordingly S* , 5**. Then 

S**(t) < 6{t) < 5*(t). 

In a recent work, A. Connes and D. Shlyaktenkho defined another quan- 
tity A, candidate to be an invariant for von Neumann algebras, by generalizing 
the notion of L 2 -homology and L 2 -Betti numbers for a tracial von Neumann 
algebra. Such a definition is in particular motivated by the work of D. Gabo- 
riau |55| . They could compare A with 5*, and therefore, thanks to the above 
corollary, to S. 

Eventhough S*,5** are not simple objects, they can be computed in some 
cases such as for the law of the DT-operators (see or in the case of a finitely 
generated group where I. Mineyev and D. Shlyakhtenko |95| proved that 

5*(r) =p x {G)-p {G) + l 

with the group L 2 Betti-numbers (3. 

DT-operators have long been an interesting candidate to try to disprove the 
invariance of S. A DT-operator T can be constructed as the limit in distribu- 
tion of upper triangular matrices with i.i.d Gaussian entries (which amounts 
to consider the law of two self-adjoint non-commutative variables T + T* and 
i(T — T*)). If C is a semicircular operator, which is the limit in distribution of 
the (non Hermitian) matrix with i.i.d Gaussian entries, then C can be written 
as T + T* where T, T are free copies of T. Hence, since S(C) < 2, we can hope 
that S(T) < 2. However, based on an heavy computation of moments of these 
DT-operators due to P. Sniady [11 lj - K. Dykema and U. Haagerup ,44 could 
prove that T generates L(F 2 ). Hence invariance would be disproved if S(T) < 2. 
But in fact, L. Aagaard [TJ recently proved that S*(T) = 2 which shows at least 
that T is not a counter-example for the invariance of 6* ( and also settle the 
case for S if one believes conjecture 17. II) . 

We now give the main ideas of the proof of Theorem l7.3l 

7.2. Large deviation upper bound for the law of the process of the 
empirical distribution of Hermitian Brownian motions 



In PHI and jSU], we established with T. Cabanal Duvillard the inequality 
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Fec l {[o,i\,cci 



Theorem 7.5. There exists a good rate function S on C([0, 1], MS™') such 
that for any /j, G M^ m) 

< -S(/i 6 ) (7.2.4) 

and 

-<S(/x 6 )>X*( M ). (7.2.5) 

Inequality \7.2.b^ is an equality as soon as (D Xi l, . . . ,D* X 1) is in the cyclic 
gradient space, i.e 

where the infimum runs over the set C 1 ([0, l],CC" t l (R)) of continuously differ- 
entiable functions with values in CC^(R), the restriction of self-adjoint non- 
commutative functions of CC^(C) defined in \6.2.'A) . Further, by definition, if 
m : C(X U ...,X n )x C(X U X n ) -> C(Xi, . . . ,X n ) is suc/i thatm(P(g>Q) = 
QP , the cyclic derivative is given by T>x l = mo Dx l ■ 

Note here that it is not clear whether (D* x \, . . . , D* x 1) should belong to 
the cyclic gradient space or not in general. This was proved by D. Voiculescu 
|126j when it is polynomial, and it seems quite natural that this should be the 
case for states with finite entropy (see a discussion in |3Up. 

The strategy is here exactly the same than in chapter consider m inde- 
pendent Brownian motion (H^, • • • , H^) and the Al^™-' -valued process fif — 
^H N (t) H N (t)' Then, it can be seen thanks to Ito's calculus that, for any 
F g C('[0,'l]',CC™(R)), if we set 

S S '\ V ,F) = v t (F t ) - v.(F.) - f v u {d u F u )du 

J s 

i /•* m 

- 77 / V U ®V U 

J s i=i 



rn -t 

<m. F,G = Yj Vu(V Xl F u V Xl Gu)du 

1=1 Js 



S sA {v) = sup (S s ' t (u,F)-l<^F,F^ s /), 

f eee 3t (Kx[o,i]) z 

then (S°'\fj, N ,F),t > 0) is a martingale with bracket given by N~ 2 <SS:F,F ^>°i 
Hence we are in business and we can prove a large deviation upper bound witn* 
good rate function which is infinite if the initial law is not the law of null oper- 
ators, and otherwise given by S 0,1 . We can proceed as in ChapterHOto improve 
the upper bound when we are considering the deviations of fj^ and see that the 
infimum is taken at a free Brownian bridge. The relation with \* comes from 
the fact that the free Brownian bridge is associated with the field 
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where = D*l denotes the non-commutative Hilbert transform of fi £ 

and /ij = t m o 7Tj is the time marginal of the free Brownian bridge. This equality 

is a direct consequence of Corollary 3.9 in |124| . 

The main problem here to hope to obtain a large deviation lower bound is 
that the associated Fokker- Planck equations are quite hard to study and we 
could not find a reasonable condition over the fields to obtain uniqueness, as 
needed (see Section ll.2.11) . 

This is the reason why the idea to study large deviation on path space 
emerged; the lower bound estimates become easier since we shall then deal with 
uniqueness of strong solutions to these Fokker-Planck equations rather than 
uniqueness of weak solutions. 

7.3. Large deviations estimates for the law of the empirical 
distribution on path space of the Hermitian Brownian motion 

In this section, we consider the empirical distribution on path space of indepen- 
dent Hermitian Brownian motions (H 1 , .., H m ). It is described by the quantities 

a N (F) = tr (P(H^(*i), H^(t 2 ), . . . , H^"(t„))) 

with F(x 1 , ..jX 171 ) = P(x ll (ti), ■ ■ ■ ,x ln (t n )) for any choice of non-commutative 
test function P, any (t\, . . . , t n ) S [0, 1]" and ■ ■ ■ , i n ) S {1, • • • , m}. 

The study of the deviations of the law of a N could a priori be performed 
again thanks to Ito's formula by induction over the number of time marginals. 
However, this is not a good idea here since we could not obtain the lower bound 
estimates. Hence, we produced a new idea to generate exponential martingales 
which is based on the Clark-Ocone formula. 

7.4. Statement of the results 

Let us be more precise. In order to avoid the problems of unboundedness of 
the operators and still get a nice topology, manageable for free probabilists, we 
considered in JS] the unitary operators 

Uf := il){Ht' 1 ), with 4){x) = (x + 4i)(a; - 4*) -1 . 

If 

n\ := (Sj + 4z)(S^-4z)- 1 
with a free Brownian motion (S 1 , . . . , S m ), it is not hard to see that 

and we shall here study the deviations with respect to this typical behavior. 
Since the U" ;1 are uniformly bounded, polynomial test functions provide a 
good topology. This amounts to restrict ourselves to a few Stieljes functionals 
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of the Hermitian Brownian motions. However, this is already enough to study 
Voiculescu's entropy when one considers the deviations toward laws of bounded 
operators since then the polynomial functions of (^(Xi), • • • , ^(X ra )) generates 
the set of polynomial functions of (Xi , • • • , X m ) and vice- versa (see Lemma in7|) . 

Let be the *-algebra of the free group generated by {u\\t G [0, G 

{1, ...,m}) (that is the set of polynomial functions generated by rT~*( u t!!)fc> 
tfc G [0, 1], i K G {1, • • • , m}, ek — 1, — 1, equipped with the involution (u t )* — 
(uj)" 1 ) and J^'"] be its self-adjoint elements. 

We denote 1 j) the set of tracial states on (i.e. the set of linear 

forms on satisfying the properties of positiveness, total mass equal to 

one, and traciality and with real restriction to We equip M. ( TJfi ^ ) 

with its weak topology with respect to TJn-n- Let ■M C (J : ^ 1 ^) be the subset of 
■M-^Jqu) of states such that for any £\, . . . , e n G { — 1, +1}™, and any i\, . . . , i n G 

{1,...,to}", the quantity r((w^) £l . . . (uj") £ ") is continuous in the variables 
t%, . . . , t n . Remark that for any process of unitary operators (U 1 , . . . , U m ) with 
values in a W* -probability space (A, if) we can associate r G M(J-^^) such 
that 

T(P)=<p(P(V)). 

Reciprocally, GNS construction allows us to associate to any r G 1 j) a 

W* -probability space (A, <f) as above. 

In particular, if S = (S , . . . , S m ) is a m-dimensional free Brownian motion, 
the family (§fz|j;t 6 [0, 1], Z G {l,...m}) defines a state a G M C (J^ A] ). To 
define our rate function, we need to introduce, for any time t G [0, 1], any process 
X = (XJ; t G [0, 1], i G {1, . . . , to}) of self-adjoint operators and a m-dimensional 
free Brownian motion S = (S 1 , . . . , S m ), X and S being free, the process X* = 
X. At + S._ fv0 . ffr6M c (^ 4] ) is the law of (^(Xj);t G [0,1], *G {l,...,m}), 
we set r* G A^J^j) the distribution of (V>(X*< l );i G [0,1], i G {l,...,m}). 
Finally, we denote Vt the non-commutative Malliavin operator given by 

vi(K) ei ■ • ■ K:) £ -) = - e;=i K=i e tMl) e ° - l )( u i + + lT p+1 ■ ■ ■ K:) e " >< 

. . . (u£:i) B *- 1 ((u£) s » - l)l [0 ,y (a). 
Note that this definition is a formal extension of 

n 

v l s(4\ ■■■ x \l) = K^ x ZVi ■■■ xl C x Z •■■ a; tpi 1 i 1 [o,* P ](' s ) 

to the (non converging in general) series u\ = — {1/Ai)(x 3 t + Ai) ^2 k (x 3 t /4i) fe , 
j G {1,...,to}. Finally, we denote B t the a algebra generated by {X^,u < 
t, 1 < I < to} and for any r G AlfJF^y), t (.\B t ) the conditional expectation 
knowing B t , i.e. the projection on B t in L 2 {t). We proved that 
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Theorem 7.6. Let I : M(Tl^ l} ) — > R + being given by 



[0,1] ^ 



/(r)= sup sup {^(F) - tr(F) - i / ^ (F(V S F|£ S ) 2 ) ds}. 
te[o,i]Fe^™;'" 2 

XTien 

J. I is a good rate function. 

2. Any t £ {I < oo} is such that there exists K T £ L 2 (t x ds) such that 



) inf Pe ™ £ r |P (V S P|£ S ) - 



ds = 0. 



fej For any P £ -T^ip an d t € [0, 1], we /lave 

5*(P) = r°(P) + f T (t s (VsP\B s ) .Kl) ds. 

3. For any closed set F C A4°(J-7q' 1 i) we have 

lim sup —j- log P(& N £ F) < - inf I(t) 

4- Let M^' * '(JFm \t) be the set of states in {I < oo} such that the infimum 
in 2. a) is attained. Then, for any open set O C JvKjF^^), 

lim inf log P(a N £ O) > - inf J(r). 

7.4-1- Application to Voiculescu's entropies 

To relate Thorem l7.6l to estimates on x, let us introduce in the spirit of Voiculescu, 
the entropy \ as follows. Let r^(i/, n, N, e) be the set of unitary matrices 
Vi, .., V m such that 

-1 < 2- 1 (V J + V*) < 1 - 2(i? 2 + l)" 1 
for all j £ {1, • • • , m} and 

IKu^..u|;)-tr(^..7/;)|<e 

for any 1 < p < n, i\, .., i p £ {1, .., m} p , 6%, ■ ■ ■ ,s p £ { — 1, +1} P . We set 

X{v) := sup inf inf limsup logP((^(Af ), • • • ,V(^)) & T u R {y, n, N, e)) . 
Let "J be the Mobius function 

*pc 1 ,...,x-) I = v>(x') = ^±J. 

Then, it is not hard to see that 
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Lemma 7.7. For any t G M.^ which corresponds, by the GNS construction, 
to the distribution of bounded operators, we have 

X(r) = x(to¥). 
Moreover, Theorem 17. 61 and the contraction principle imply 

Theorem 7.8. Let ir 1 : M^ 00 ^^) -> M(T^ 1} ) = M% be the projection on 
the algebra generated by (U^l), . . .' , U m (l), U 1 (l)" 1 , . . . , IP^l)" 1 ). Then, for 
any r € M {m) , 

X**(r) ^-liminfUM^CaoTr^ro*) <j )ff 6^ 0O (ff 1] )} < x (r) (7.4.6) 
and 

X(r) < - lim inf{/(<7); d(<7 ottx, to*) < 6,a£ M*°° (JW j,)} 

= -mf{/(cr),cro 7Ti = r o (7.4.7) 

The above upper bound can be improved by realizing that the infimum has 
to be achieved at a free Brownian bridge (generalizing the ideas of Chap- 
ter H3 Section 16.7(1 . In fact, if // is the distribution of m self-adjoint oper- 
ators {Xi, . . . , X m } and {S 1 , . . . , S m } is a free Brownian motion, free with 
{Xi, . . . , X,„}, we denote the distribution of 

|^(tX z + (1 - t)S l _^_),l <l<m,te [0,1]}. Then 
Theorem 7.9. For any fj, S M^ n \ 

X(M)<-/(^)=X*(M) (7-4.8) 

7.4.3. Proo/ of Theorem\Td[ 

We do not prove here that I is a good rate function. The idea of the proof of the 
large deviations estimates is again based on the construction of exponential mar- 
tingales; In fact, for P € T m 1} , it is clear that Mp (t) = E[a N ' {P)\T t }- E[a N ' (P)] 
is a martingale for the filtration J- t of the Hermitian Brownian motions. Clark- 
Ocone formula gives us the bracket of this martingale : 

i TO „t 

< M p >*= ^E / te{E[V 8 P\F s ] 2 )ds 
i=i • )o 

As a consequence, for any P S J~\^iy and t E [0, 1] 

E[eMN 2 (E[a N (P)\F t ] - E[a N (P)} - f tr(E[V s P\T s } 2 )ds)}} = 1. 

Jo 

To deduce the large deviation upper bound from this result, we need to show 
that 
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Proposition 7.10. For P £ r G M c '°° {T^) , e > 0, I € N, L e K+ ; 

define 



H := H(P,L,e,N,r,l) 

= ess sup \tr(E[P(U N )\n t } l )~T^(P\B t ) l )\ 
{d(a N ,T)<e-,a N eK LV -nr L } 

then one has, for every I £ N 

sup lim sup lim sup sup H = (7.4.9) 

L>0 e^O N—>oo ^eK L ^-nr L 
te[o,l] 

Here {Kl^^^l^l&h are compact subsets of Ai{T m ^) such that 

lim sup lim sup log F(a N £ {K L rC\T L ) c ) = -oo. 

Then, we can apply exactly the same techniques than in Chapter 0] to prove 
the upper bound. 

To obtain the lower bound, we have to obtain uniqueness criteria for equations 
of the form 

r*(P)-tr(P)= / r(r s (V s P\B s )r s (V s K\B s ))d S 
Jo 

with fields K as general as possible. We proved in ^S], Theorem 6.1, that if 
K e Tl^ , the solutions to this equation are strong solutions in the sense that 
there exists a free Brownian motion S such r is the law of the operator X 
satisfying 

dX t = dS t +? t (V t K\B t )(X)dt. 

But, if K £ ^loi] ! ^ is not hard to see that r* (V ' t K\B t )(X.) is Lipschitz operator, 
so that we can see that there exists a unique such operator X, implying the 
uniqueness of the solution of our free differential equation, and hence the large 
deviation lower bound. 



7.5. Discussion and open problems 

Note that we have the following heuristic description of \* and x** '■ 

X*(r)=-inf{ f\i(K?)dt} 
Jo 

where the infimum is taken over all laws /j, of non-commutative processes which 
are null operators at time 0, operators with law r at time one and which are 
the distributions of 'weak solutions' of 

dX t = dS t + K t (X)dt. 
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X** is defined similarly but the infimum is restricted to processes with smooth 
fields K (actually K 6 ^mx\)- We then have proved in Theorem 17. 81 that 

X* < X < X 

and it is legitimate to ask when \** — X* ■ Such a result would show \ — X* ■ 
Note that in the classical case, the relative entropy can actually be described 
by the above formula by replacing the free Brownian motion by a standard 
Brownian motion and then all the inequalities become equalities. 
This question raises numerous questions : 

1. First, inequalities (|7.4.6J) and (|7.4.8|l become equalities if t~ G -M^ 00 ^™^) 
that is if there exists n, times (i,, 1 < i < n+1) e [0, and polynomial 
functions (Qi, 1 < i < n) and P such that 

D^l = J A = £ h ti ,t i+l] Qi + r^(V t P|B t ). 

i=l 

Can we find non trivial [i € such that this is true? 

2. If we follow the ideas of Chapter to improve the lower bound, we would 
like to regularize the laws by free convolution by free Cauchy variables 
C c = (Cf, • • • , Cf n ) with covariance e. If X = (Xi, • • • , X m ) is a process 
satisfying 

dX t = dS t + K t (X t )dt, 

for some non-commutative function K t , it is easy to see that X e = X + 
C e satisfies the same free Fokker-Planck equation with iT t c (X t + C e ) = 
T(i4T t (X t )|X t + C e ). Then, does K e is smooth with respect to the operator 
norm? This is what we proved for one operator in |65| . If this is true in 
higher dimension, then Connes question is answered positively since by 
Picard argument 

dX| = dS t +K e t {X t )dt 

has a unique strong solution and there exists a smooth function F e such 
that for any t > 

X$=FZ(S s ,s<t). 
In particular, for any polynomial function P £ C(Xi, . . . , X m ) 

M(P(X + C e )) = a(P o Ff(S S) s < 1)) = lim tr(P o F x £ (Hf , s < 1)) 

N— *oo 

where we used in the last line the smoothness of Ff as well as the con- 
vergence of the Hermitian Brownian motion towards the free Brownian 
motion. Hence, since e is arbitrary, we can approximate fi by the em- 
pirical distribution of the matrices i^(Hg ,s < 1), which would answer 
Connes question positively. As in remark lY.2l the only way to complete the 
argument without dealing with Connes question would be to be able to 
prove such a regularization property only for laws with finite entropy, but 
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it is rather unclear how such a condition could enter into the game. This 
could only be true if the hyperfinite factor would have specific analytical 
properties. 

3. If we think that the only point of interest is what happens at time one, 
then we can restrict the preceding discussion by showing that if X = 
(Xi,-- - , X m ) are non-commutative variables with law [a, and , 1 < 
i < m) is the Hilbert transform of \i and if we let \i e be the law of X + C e , 
then we would like to show that {J^ , 1 < i < m) is smooth for e > 0. 
In the case m = 1, is analytic in {|9(z)| < e}. The generalization to 
higher dimension is wide open. 

4. A related open question posed by D. Voiculescu |12H) (in a paragraph 
entitled Technical problems) could be to try to show that the free convo- 
lution acts smoothly on Fisher information in the sense that t E M + — ► 
Tx+ts{\Ji X+ts \ 2 ) is continuous. 

5. A different approach to microstates entropy could be to study the gen- 
erating functions A(P) given, for P 6 C(Xi, • • • , X m ) ® C(Xl, • • • , X m ), 
by 

lim limsup-^log / e Tr®Tr ( P(xJ,,...,x^)) TT dm (X%) 

It is easy to see (and written down in |67|) that 

X(P)= inf {HP) -t®t(P)}. 

— PeC(x lt - ,x m )®i 

Reciprocally, 

A(P)= sup {x(t)+t®t(P)}. 

Therefore, we see that the understanding of the first order of all matrix 
models is equivalent to that of \- I n particular, the convergence of all 
of their free energies would allow to replace the limsup in the definition 
of the microstates entropy by a liminf, which would already be a great 
achievement in free entropy theory. Note also that in the usual proof of 
Cramer's theorem for commutative variables, the main point is to show 
that one can restrict the supremum over the polynomial functions P 6 
C(Ai,--- ,X m )® 2 to polynomial functions in C(Xi,--- ,X m ) (i.e. take 
linear functions of the empirical distribution). This can not be the case 
here since this would entail that the microstates entropy is convex which 
it cannot be according to D. Voiculescu |124| who proved actually that if 
r =/= t' G with m > 2, r and r 1 having finite microstates entropy, 

then qt + (1 — a)r' have infinite entropy for a £ (0, 1). 
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